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NUCLEIC ACID MOLECULES ENCODING A TRANSMEMBRANE SERINE 
PROTEASE 25, THE ENCODED POLYPEPTIDES AND METHODS BASED 

THEREON 

RELATED APPLICATIONS 

5 Benefit of priority under 35 U.S.C. § 1 1 9(e) is claimed to U.S. provisional 

application Serial No. 60/328,530, filed October 9, 2001 , to Edwin L. Madison 
and Jiunn-Chern Yeh, entitled "NUCLEIC ACID MOLECULES ENCODING 
TRANSMEMBRANE SERINE PROTEASE 25, THE ENCODED PROTEINS AND 
METHODS BASED THEREON." The subject matter of this application is 
10 incorporated in its entirety by reference thereto. 

This application is also related to International PCT application No. 
PCT/xxxx/xxxxx (attorney docket no. 24745-1 621 PC), filed on the same day 
herewith, entitled "NUCLEIC ACID MOLECULES ENCODING TRANSMEMBRANE 
SERINE PROTEASE 25, THE ENCODED POLYPEPTIDES AND METHODS BASED 
15 THEREON." The disclosure of the PCT application is herein incorporated by 
reference in its entirety. 
FIELD OF INVENTION 

Nucleic acid molecules that encode proteases and portions thereof, 
particularly protease domains are provided. Also provided are prognostic, 
diagnostic and therapeutic methods using the proteases and domains thereof and 
the encoding nucleic acid molecules. 

BACKGROUND OF THE INVENTION AND OBJECTS THEREOF 

Cancer, which is a leading cause of death in the United States, is 
characterized by an increase in the number of abnormal neoplastic cells, which 
25 proliferate to form a tumor mass, the invasion of adjacent tissues by these 

neoplastic tumor cells, and the generation of malignant cells that metastasize via 
the blood or lymphatic system to regional lymph nodes and to distant sites. 
Among the hallmarks of cancer is a breakdown in the communication among 
tumor cells and their environment. Normal cells do not divide in the absence of 
stimulatory signals and cease dividing in the presence of inhibitory signals. 
Growth-stimulatory and growth-inhibitory signals, are routinely exchanged 
between cells within a tissue. In a cancerous, or neoplastic, state, a cell 
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acquires the ability to "override" these signals and to proliferate under conditions 
in which normal cells do not grow. 

In order to proliferate tumor cells acquire a number of distinct aberrant 
traits reflecting genetic alterations. The genomes of certain well-studied tumors 
5 carry several different independently altered genes, including activated 

oncogenes and inactivated tumor suppressor genes. Each of these genetic 
changes appears to be responsible for imparting some of the traits that, in the 
aggregate, represent the full neoplastic phenotype. 

A variety of biochemical factors have been associated with different 
10 phases of metastasis. Cell surface receptors for collagen, glycoproteins such as 
laminin, and proteoglycans, facilitate tumor cell attachment, an important step in 
invasion and metastases. Attachment triggers the release of degradative 
enzymes which facilitate the penetration of tumor cells through tissue barriers. 
Once the tumor cells have entered the target tissue, specific growth factors are 
15 required for further proliferation. Tumor invasion and progression involve a 

complex series of events, in which tumor cells detach from the primary tumor, 
break down the normal tissue surrounding it, and migrate into a blood or 
lymphatic vessel to be carried to a distant site. Degradation and/or remodeling 
of normal tissue barriers is accomplished by the elaboration of specific enzymes 
20 that cleave the proteins of the extracellular matrix that make up basement 
membranes and stromal components of tissues. 

A class of extracellular matrix degrading enzymes has been implicated in 
tumor invasion. Among these are the matrix metalloproteinases (MMP). For 
example, the production of the matrix metalloproteinase stromelysin is 
25 associated with malignant tumors with metastatic potential (see, e.g., McDonnell 
eta/. (1990) Smnrs. in Cancer Biology 7:107-115; McDonnell et aL (1990) 
Cancer and Metastasis Reviews S:309-319). 

The capacity of cancer cells to metastasize and invade tissue is facilitated 
by degradation of the basement membrane. Several proteinase enzymes, 
30 including the MMPs, have been reported to facilitate the process of invasion of 
tumor cells. MMPs are reported to enhance degradation and/or remodeling of 
the basement membrane, which thereby permits tumorous cells to invade 
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tissues. For example, two major metalloproteinases having molecular weights 
of about 70 kDa and 92 kDa appear to enhance ability of tumor cells to 
metastasize. 

Type II Transmembrane Serine Proteases 
5 In addition to the MMPs, serine proteases have been implicated in 

neoplastic disease progression. Most serine proteases, which are either 
secreted enzymes or are sequestered in cytoplasmic storage organelles, have 
roles in blood coagulation, wound healing, digestion, immune responses and 
tumor invasion and metastasis. A class of cell surface proteins designated type 
10 ii transmembrane serine proteases, which are membrane-anchored proteins with 
additional extracellular domains, has been identified. As cell surface proteins, 
they are positioned to play a role in intracellular signal transduction and in 
mediating cell surface proteolytic events. 

Cell surface proteolysis is a mechanism for the generation of biologically 
1 5 active proteins that mediate a variety of cellular functions. Membrane- 
associated proteases include membrane-type metalloproteinases (MT-MMP), 
ADAMs (proteases that contain disintegrin-like and metalloproteinase domains) 
and the transmembrane serine proteases. In mammals, at least 17 members of 
the transmembrane serine protease family are known, including seven in humans 
20 (see, Hooper et aL (2001) J. Biol. Chem. 276:857-860). These include; conn 
(accession nos. AF1 33845 and AB013874; see, Yan et aL (1 999) J. Biol. Chem. 
274:14926-14938; Tomia et aL (1998) J. Biochem. 724:784-789; Uan et aL 
(2000) Proc. NatL Acad. ScL U.S.A. -97:8525-8529); enteropeptidase (also 
designated enterokinase; accession no. U09860 for the human protein; see, 
25 Kitamoto et aL (1995) Biochem. 27\ 4562-4568; Yahagi et'aL (1996) Biochem. 
Biophys. Res. Commun. 2/S:806-812; Kitamoto et aL (1994) Proc. NatL Acad. 
ScL U.S.A. ,97:7588-7592; Matsushima et aL (1 994) J. BioL Chem. 263:19976- 
19982;); human airway trypsin-like protease (HAT; accession no. AB002134; 
see Yamaoka et aL J. BioL Chem. 273:11894-11901); MTSP1 and matriptase 
30 (also called TADG-15; see SEQ ID Nos. 1 and 2; accession nos. 

AF133086/AF1 18224, AF04280022; Takeuchi et aL (1999) Proc. NatL Acad. 
ScL U.S.A. 96-A 1054-1161; Lin et aL (1999) J. BioL Chem. 274:18231-18236; 
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Takeuchi et aL (2000) J. Biol. Chem. 275:26333-26342; and Kim et aL (1999) 
tmmunogenetics 4S:420-429); hepsin (see, accession nos. Ml 8930, AF030065, 
X70900; Leytus et al. (1988) Biochem. 27\ 1 1895-1 1 901; Vu et aL (1997) J. 
Biol. Chem. 272:31315-31320; and Farley et al. (1993) Biochem. Biophys. Acta 
5 7/73:350-352; and see, U.S. Patent No. 5,972,616); TMPRS2 (see, Accession 
Nos. U75329 and AF1 13596; Paoloni-Giacobino et aL (1997) Genomics 44:309- 
320; and Jacquinet et aL (2000) FEBS Lett. 468: 93-100); and TMPRSS4 (see, 
Accession No. NM 016425; Wallrapp et aL (2000) Cancer 50:2602-2606). 

Serine proteases, including transmembrane serine proteases and secreted 
10 proteases, have been implicated in processes involved in neoplastic development 
and progression. While the precise, detailed mechanism by which these 
proteases promote tumor growth and progression has not been elaborated, 
serine proteases and inhibitors thereof are involved in the control of many intra- 
and extracellular physiological processes, including degradative actions in cancer 
1 5 cell invasion, metastatic spread, and neovascularization of tumors, that are 

involved in tumor progression. It is believed that proteases are involved in the 
degradation of extracellular matrix (ECM) and contribute to tissue remodeling, 
and are necessary for cancer invasion and metastasis. The activity and/or 
expression of some proteases have been shown to correlate with tumor 
20 progression and development. 

For example, a membrane-type serine protease MTSP1 (also called 
matriptase; see SEQ ID Nos. 1 and 2 from U.S. Patent No. 5,972,616; and 
GenBank Accession No. AF1 18224; (1999) J. Biol. Chem. 274:18231-18236; 
U.S. Patent No. 5,792,616; see, also Takeuchi (1999) Proc. Natl. Acad. ScL 
25 U.S.A. 96-A 1054-1 161) that is expressed in epithelial cancer and normal tissue 
(Takeucuhi et aL (1999) Proc. Natl. Acad. Sci. USA 55:11054-61) has been 
identified. Matriptase was originally identified in human breast cancer cells as a 
major gelatinase (see, U.S. Patent No. 5 r 482 f 848) and was initially believed to 
be a type of matrix metalloprotease (MMP). It has been proposed that it plays a 
30 role in the metastasis of breast cancer. Matriptase also is expressed in a variety 
of epithelial tissues with high levels of activity and/or expression in the human 
gastrointestinal tract and the prostate. MTSPs, designated MTSP3, MTSP4, 
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MTSP6 have been described in published Internationa! PCT application No. WO 
01/57194, based in International PCT application No. PCT/US0 1/03471 . 

Prostate-specific antigen (PSA), a kallikrein-like serine protease, degrades 
extracellular matrix glycoproteins fibronectin and laminin, and, has been 
5 postulated to facilitate invasion by prostate cancer cells (Webber eta/. (1995) 
Clin. Cancer Res. 7:1 089-1 094). Blocking PSA proteolytic activity with 
PSA-specific monoclonal antibodies results in a dose-dependent decrease in vitro 
in the invasion of the reconstituted basement membrane Matrigel by LNCaP 
human prostate carcinoma cells which secrete high levels of PSA. 
10 Hepsin, a cell surface serine protease identified in hepatoma cells, is 

overexpressed in ovarian cancer (Tanimoto et aL (1997) Cancer Res., 
57:2884-2887). The hepsin transcript appears to be abundant in carcinoma 
tissue and is almost never expressed in normal adult tissue, including normal 
ovary. It has been suggested that hepsin is frequently overexpressed in ovarian 
15 tumors and therefore can be a candidate protease in the invasive process and 
growth capacity of ovarian tumor cells. 

A serine protease-like gene, designated normal epithelial cell-specific 1 
(NES1) (Liu etal. Cancer Res. . 56(141 :3371-9 (1996)) has been identified. 
Although expression of the NES1 mRNA is observed in all normal and 
20 immortalized nontumorigenic epithelial cell lines, the majority of human breast 
cancer cell lines show a drastic reduction or a complete lack of its expression. 
The structural similarity of NES1 to polypeptides known to regulate growth 
factor activity and a negative correlation of NES1 expression with breast 
oncogenesis suggest a direct or indirect role for this protease-like gene product 
25 in the suppression of tumorigenesis. 

Hence transmembrane serine proteases appear to be involved in the 
etiology and pathogenesis of tumors. There is a need to further elucidate their 
role in these processes and to identify additional transmembrane proteases. 
Therefore, among the objects herein, it is an object herein to provide 
30 transmembrane serine protease (MTSP) proteins and nucleic acids encoding 
such MTSP proteases that are involved in the regulation of or participate in 
tumorigenesis and/or carcinogenesis. Also, among the objects herein, it is also 
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an object herein to provide prognostic, diagnositic and therapeutic methods 

using such proteases and the acids encoding such proteases. 

SUMMARY 

Provided herein are transmembrane serine proteases (MTSPs) and nucleic 
5 acids encoding such MTSP proteases that are involved in the regulation of or 
participate in tumorigenesis and/or carcinogenesis. Also, provided are 
prognostic, diagnostic, and therapeutic methods using such proteases and the 
nucleic acids encoding such proteases. 

In particular, provided herein is the protease domain of a polypeptide 
10 designated herein as MTSP25. The protease domain and full-length protein, 

including the zymogen and activated forms, and uses thereof are also provided. 
Polypeptides encoded by splice variants are also provided. Hence, provided 
herein is a family proteins designated MTSP25, and functional domains, 
including one or more of a transmembrane (TM) domain, and a serine protease 
15 catalytic domain, especially protease (or catalytic) domains thereof. Also 

provided are muteins and other derivatives and analogs thereof. Also provided 
herein are nucleic acids encoding the MTSP25s. MTSP25s are type I 
transmembrane serine proteases. 

The protease domains provided herein include, but are not limited to, the 
20 single chain region having an N-terminus at the cleavage site for activation of the 
zymogen, through the C-terminus, or C-terminal truncated portions thereof that 
exhibit proteolytic activity as a single-chain polypeptide in in vitro proteolysis 
assays of MTSP25, from a mammal, including a human, that, for example, 
displays functional activity in tumor cells that is different from its activity non-in 
25 non-tumor cells. 

MTSP25 is a member of the family of the Transmembrane Serine 
Protease family, particularly the Type I Transmembrane Serine Protease family 
(also referred to herein as MTSPs), and more particularly MTSP family members 
whose functional activity differs in tumor cells from non-tumor cells in the same 
30 tissue. MTSP25 may also be expressed as a secreted protein that includes the 
protease domain. Type I integral membrane proteins have an N-terrminal signal 
peptide that is cleaved, generating a new N-terminus on the non-cytoplasmic 



WO 03/031585 



PCTYUS02/32417 



-7- 



side of the membrane, and remain anchored through the C-terminus. In the 
disclosed MTSP protein, the signal peptide sequence encompasses residues 1-16 
SEQ ID No. 16. This protein is anchored to the membrane through a C-terminal 
transmembrane domain (residues 324-346 SEQ ID No. 16). 
5 Nucleic acid molecules encoding the proteins and protease domains are 

also provided. Nucleic acid molecules that encode a single-chain protease 
domain or catalytically active portion thereof and also those that encode the full- 
length MTSP25 or portions thereof are provided. In one embodiment, a nucleic 
acid that encodes a MTSP, designated MTSP25 is provided. The nucleic acid 
10 molecule includes the sequence of nucleotides set forth in SEQ ID No. 5 that 
encode amino acids 1-237 of SEQ ID No. 6 (residues 78-314 of SEQ ID No. 16) 
or a catalytially active portion thereof (see, also EXAMPLE 1) that encodes a 
catalytically active polypeptide or a domain thereof. Also included are 
sequences of nucleotides in which there is a G to A polymorphism at nucleotide 
15 379 of SEQ ID No. 15, giving rise to a Thr at amino acid 127. Variants of these 
sequences of nucleotides and amino acids as described herein are also provided. 

Also provided are nucleic acid molecules that encode ail or a portion of a 
catalytically active polypeptide, or a nucleic acid molecule that encodes the 
protease domain or a larger polypeptide that can include up to the full length 
20 polypeptide and that hybridizes to such MTSP25-encoding nucleic acid along 

their full-length or along at least about 70%, 80% or 90% of their full-length and 
encode the protease domain or portion thereof are provided. Hybridization is 
generally effected under conditions of at least low, generally at least moderate, 
and often high stringency. 

The isolated nucleic acid fragment is DNA, including genomic or cDNA, or 
is RNA, or can include other components, such as pepeptide nucleic acid or 
other nucleotide analogs. The isolated nucleic acid may include additional 
components/such as heterologous or native promoters, and other transcriptional 
and translational regulatory sequences, these genes may be linked to other 
30 genes, such as reporter genes or other indicator genes or genes that encode 
indicators. 
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Also provided is an isolated nucleic acid molecule that includes the 
sequence of molecules that is complementary to the nucleotide sequence 
encoding MTSP25 or a portion thereof. 

Also provided are nucleic acid molecules that hybridize under conditions 
5 of at least low stringency, generally moderate stringency, more typically high 
stringency to the sequence of nucleotides set forth in SEQ ID No. 5 or SEQ ID 
No. 15 or degenerates thereof. In one embodiment, the isolated nucleic acid 
fragment hybridizes to a nucleic acid molecule containing the nucleotide 
sequence set forth in SEQ ID No. 5 or SEQ ID No. 15 (or degenerates thereof) 
10 under high stringency conditions. In one embodiment, it contains the sequence 
of nucleotides set forth in SEQ ID No. 5. A full-length MTSP25 polypeptide 
includes the sequence of amino acids set forth in SEQ ID No. 6 or SEQ ID No. 
16, and is encoded by a sequence of nucleotides set forth in SEQ ID No. 5 or 
SEQ ID No. 15 or degenerates thereof. Methods for isolating nucleic acid 
1 5 encoding other MTSP25s, including nucleic acid molecules encoding f ulMength 
molecules and splice variants and MTSPs from species, such as cows, sheep, 
goats, pigs, horses, primates, including chimpanzees and gorillas, rodents, dogs, 
cats and other species of interest, such as domesticated animals, farm and zoo 
animals are also provided. The nucleic acid molecules provided herein, including 
20 those set forth in SEQ ID Nos. 5 and 16 can be used to obtain nucleic acid 
molecules encoding full-length MTSP25 polypeptides from human sources or 
from other species, such as by screening appropriate libraries using the nucleic 
acid molecules or selected primers or probes based thereon. 

Also provided are fragments thereof or oligonucleotides that can be used 
25 as probes or primers and that contain at least about 10, 14, 16 nucleotides, 

generally less than 1000 or less than or equal to 100, set forth in SEQ ID No. 5 
or SEQ ID No. 15 (or the complement thereof); or contain at least about 30 
nucleotides (or the complement thereof) or contain oligonucleotides that 
hybridize along their full-length (or at least about 70, 80 or 90% thereof) to any 
30 such fragments or oligonucleotides. The length of the fragments are a function 
of the purpose for which they are used and/or the complexity of the genome of 
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interest. Generally probes and primers contain less than about 30, 50, 1 50 or 
500 nucleotides. 

Also provided are plasmids containing any of the nucleic acid molecules 
provided herein. Cells containing the plasmids are also provided. Such cells 
5 include, but are not limited to, bacterial cells, yeast cells, fungal cells, plant cells, 
insect cells and animal cells. 

Methods of expressing the encoded MTSP25 polypeptide and portions 
thereof using the cells are also provided, as are cells that express MTSP25 on 
the cell surface. Such cells are used in methods of identifying candidate 
10 therapeutic compounds. 

MTSP25, particularly the protease domain thereof, can be produced by 
growing the above-described cells under conditions whereby the MTSP25 is 
expressed by the cells, and recovering the expressed MTSP25 polypeptide. 

Also provided are cells, generally eukaryotic cells, such as mammalian 
15 cells and yeast cells, in which the MTSP25 polypeptide is expressed on the 
surface of the cells. Such cells are used in drug screening assays to identify 
compounds that modulate the activity of the MTSP25 polypeptide. These 
assays, including in vitro binding assays, and transcription based assays in 
which signal transduction is mediated directly or indirectly, such as via activation 
20 of pro-growth factors, by the MTSP25 is assessed. 

Also provided are peptides that are encoded by such nucleic acid 
molecules. Included among those polypeptides are the MTSP25 protease 
domain or a polypeptide with amino acid changes such that the specificity and/or 
protease activity remains substantially unchanged. In particular, a substantially 
25 purified mammalian MTSP25 polypeptide is provided that includes a serine 
protease catalytic domain and may additionally include other domains. The 
MTSP25 can form homodimers and can also form heterodimers with some other 
protein, such as a membrane-bound protein. Also provided is a substantially 
purified protein including a sequence of amino acids that has at least 60%, 70%, 
30 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% identity to the MTSP25 where the 
percentage identity is determined using standard algorithms and gap penalties 
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that maximize the percentage identity. A human MTSP25 polypeptide is 
exemplified, although other mammalian MTSP25 polypeptides are contemplated. 
Splice variants of the MTSP25, particularly those with a proteolytically active 
protease domain, are contemplated herein. 
5 In other embodiments, substantially purified polypeptides that include a 

protease domain of a MTSP25 polypeptide or a catalytically active portion 
thereof are provided. Among these are polypeptides that include a sequence of 
amino acids that has at least 60%, 70%, 80%, 85%, 90%, 95% or 100% 
sequence identity to SEQ ID No. 6 or SEQ ID No. 16 or to a portion thereof that 
10 includes a catalytically active polypeptide. 

Also provided are muteins of the single chain protease domain of 
MTSP25 particularly muteins in which the Cys residue in the protease domain 
that is free (i.e., does not form disulfide linkages with any other Cys residue in 
the protease domain) is substituted with another amino acid substitution, 
1 5 typically, although not necessarily, with a conservative amino acid substitution 
or a substitution that does not eliminate the activity, and muteins in which a 
glycosylation site(s) is eliminated. 

Hence muteins in which one or more of the Cys residues, particularly, a 
residue that is paired in the activated two chain form, but unpaired in the 
20 protease domain alone [i.e., the Cys at residue position 191 (see SEQ ID Nos. 15 
and 16) in the protease domain), is/are replaced with any amino acid, typically, 
although not necessarily, a conservative amino acid residue, such as Ser, are 
contemplated. Muteins of MTSP25, particularly those in which Cys residues, 
such as the unpaired Cys in the single chain protease domain, is replaced with 
25 another amino acid that does not eliminate the activity, are provided. Muteins in 
which other conservative or non-conservative amino acid substitutions in which 
catalytic and/or binding activity is retained are also contemplated (see, e.g., 
Table 1, for exemplary amino acid substitutions). 

MTSP25 polypeptides, including, but not limited to splice variants 
30 thereof, and nucleic acids encoding MTSPs, and domains, derivatives and 

analogs thereof are provided herein. Single chain protease domains that have an 
N-terminus functionally equivalent to that generated by activation of the 
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zymogen form of MTSP25 are also provided. The cleavage site for the protease 
domain of MTSP25 is between amino acid R and amino acid I (FUIIGGT) (see 
SEQ ID NO. 6, residues 1-5 (IIGGT); residues 77-82 SEQ ID No. 16 (RIIGGT). 
Thus, MTSP25 contains a trypsin-like serine protease domain (residues 78-323, 
5 or C-terminal truncated variants (78-313, 314, 315, 316 . . . 322, or shorter 
variants that are catalytically active, following cleavage) characterized by the 
presence of a protease activation cleavage site (...R 77 H 7Q IGG..., where * 
indicates a protease activation/cleavage site) at the amino terminus of the 
domain and catalytic triad residues (H 122 , D 17l and S 26a ) in three highly-conserved 

10 regions. In addition, two other domains, a signal peptide sequence (aa 1-aa 16 
SEQ ID No. 16) and a C-terminal transmembrane domain (aa 324-aa 346 SEQ 
ID No. 17), occur at the amino and carboxy termini, respectively, of MTSP25. 
There are 2 potential /V-glycosylation sites: N 219 AT and N 249 TS. There is an 
unpaired C 191 in the serine protease domain that can pair with C 64 outside of the 

1 5 serine protease domain. Within the serine protease domain, the following 
cysteine pairings occur: C 107 -C 123 , C 206 -C 274 , C 237 -C 253 , and C 284 -C 294 . One 
cysteine (C 340 ) within the C-terminal transmembrane domain is apparently 
unpaired. Alignment {e.g. , b/astp; http://www.ncbi.nlm.nih.gov/BLAST) of 
MTSP25 protein sequence with that of enterokinase (GenBank accession number 

20 U09860) indicates that the two proteins share 41 % sequence identity in their 
protease domains. 

As noted the protease also can be provided as a two chain molecule. 
Single chain and two chain forms of MPTSP25 can be proteolytically active. A 
two chain form of the polypeptide set forth in SEQ ID No. 16 is provided; smaller 

25 catalytically active two chain forms are also provided. A two chain form is 

produced by activation cleavage due to covalent bonding, typically between the 
C 191 and a Cys outside the protease domain, such as Cys 64 . Upon activation 
cleavage the bond remains resulting in a two chain polypeptide. The size of 
chain "A" is a function the starting length of the polypeptide prior to activation 

30 cleavage between the R 77 and l 7a . Any length polypeptide that includes the 
protease domain (at least residues 78-313 up to 323 of SEQ ID No. 16) or 
cataiyticaiiy active fragments thereof, is contemplated herein. Two chain forms 
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include at least the protease domain of a polypeptide from C 64 up to and 
including C 191 . Such two chain forms include those in which the "A" chain 
contains amino acids 17-77 <and shorter fragments that can be incrementally 
shorter to amino acids 64-77), and the "B" chain contains acids 78-348, 
5 typically 78-323 (or truncated variants (78-313, 314, 31 5, 316 . . . 322, or 
shorter variants that are catalytically active, following cleavage)) and 
catalytically active fragments thereof. The single chain form contains amino 
acids 78-348, typically 78-323 (or truncated variants (78-313, 314, 315, 316 . 
• 322, or shorter variants that are catalytically active, following cleavage)', and 
10 catalytically active fragments thereof . 

MTSPs are expressed or are activated in certain tumor or cancer cells 
such as lung, prostate, colon and breast, ovarian, pancreatic, lung in other 
tumors. MTSP25 is of interest because it is expressed or is active in tumor 

cells. In particular, it is shown herein, that MTSP25 is, for example, expressed 
in breast, colon, uterine, ovarian, kidney, prostate, testicular cancer tissue and in 
cell lines as well as in certain normal cells and tissues (see e.g., EXAMPLES for 
tissue-specific expression profile).. It may also be expressed in lung, stomach 
prostate and in other tumors. The expression of the this protein can be used to 
monitor cancer and cancer therapy. The level of activated MTSP25 can be 
diagnostic of prostate, uterine, lung esophagus, breast or colon cancer or 
leukemia or other cancer. The expression and/or activation of MTSP25 on or in 
the vicinity of a cell or in a bodily fluid in a subject can be a marker for breast, 
prostate, lung, colon, uterine, kidney, testicular and other cancers. 

Hence the MTSPs provided herein can serve as diagnostic markers for certain 
tumors. In certain embodiments, the MTSP25 polypeptide is detectable in a 
body fluid at a level that differs from its level in body fluids in a subject not 
having a tumor. In other embodiments, the polypeptide is present in a tumor; 
and a substrate or cofactor for the polypeptide is expressed at levels that differ 
from its level of expression in a non-tumor cell in the same type of tissue. In 
other embodiments, the level of expression and/or activity of the MTSP25 
polypeptide in tumor cells differs from its level of expression and/or activity in 
non-tumor cells. | n otne r embodiments, the MTSP25 is present in a tumor; and 
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a substrate or cofactor for the MTSP25 is expressed at levels that differ from its 
level of expression in a non-tumor cell in the same type of tissue. 

Assays for identifying effectors, such as compounds, including small 
molecules, and conditions, such as pH, temperature and ionic strength, that 
5 modulate the activation, expression or activity of MTSP25 are also provided 
herein. In exemplary assays, the effects of test compounds on the ability of a 
protease domain of MTSP25 to proteolytically cleave a known substrate, 
typically a fluorescently, chromogenically or otherwise detectably labeled 
substrate, are assessed. Agents, generally compounds, particularly small 
10 molecules, that modulate the activity of the protease domain are candidate 

compounds for modulating the activity of the MTSP25. The protease domains 
can also be used to produce protease domain-specific antibodies. 

Also provided are methods for screening for compounds that modulate 
the activity of MTSP25. The compounds are identified by contacting them with 
15 the MTSP25 or protease domain thereof and a substrate for the MTSP25. A 
change in the amount of substrate cleaved in the presence of the compounds 
compared to that in the absence of the compound indicates that the compound 
modulates the activity of the MTSP25. Such compounds are selected for further 
analyses or for use to modulate the activity of the MTSP25, such as inhibitors or 
20 agonists. The compounds can also be identified by contacting the substrates 
with a cell that expresses the MTSP25 or the extracellular domain or 
proteolytically active portion thereof.. 

Also provided herein are methods of modulating the activity of the 
MTSP25 and screening for compounds that modulate, including inhibit, 
25 antagonize, agonize or otherwise alter the activity of the MTSP25. Of particular 
interest is the extracellular domain of MTSP25 that includes the proteolytic 
(catalytic) portion of the protein. 

Additionally provided herein are antibodies that specifically bind to 
activated two chain (or single chain) forms of the full-length MTSP25, antibodies 
30 that specifically bind to one or both of the single-chain or two-chain form, but do 
not bind or bind with at least 2, 5, or 10-fold less affinity, to the full-length 
zymogen form of an MTSP25, cells, combinations, kits and articles of 
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manufacture that contain the antibodies. Antibodies that specifically bind (i.e. 
bind with at least 2, 5 or 10-fold greater affinity compared to another protein) to 
the MTSP25, particularly those that specifically bind to an activated form one or 
both of the single-chain or two-chain forms of the protease domain or full-length 
5 two-chain form, but not to the full-length zymogen form of an MTSP25. 

Neutralizing antibodies that inhibit a biological activity, particularly 
protease activity, are also provided. The neutralizing antibodies are typically 
selected to specifically bind to the single-chain or two-chain forms of the 
protease domain or to a full-length two-chain form, but not to the full-length 
10 zymogen form of an MTSP25. 

Further provided herein are prognostic, diagnostic, therapeutic screening 
methods using MTSP25 and the nucleic acids encoding MTSP25. In particular, 
the prognostic, diagnostic and therapeutic screening methods are used for 
preventing, treating, or for finding agents useful in preventing or treating, tumors 
1 5 or cancers such as, but are not limited to, lung carcinoma and other cancers, 
breast cancer, prostate cancer, colon adenocarcinoma and other cancers and 
ovarian carcinoma and other ovarian cancers. 

Also provided herein are modulators of the activity of MTSP25, especially 
the modulators obtained according to the screening methods provide herein. 
20 Such modulators can have use in treating cancerous conditions. 

Methods of diagnosing a disease or disorder characterized by detecting an 
aberrant level of an MTSP25 in a subject is provided. The method can be 
practiced by measuring the level of the DNA, RNA, protein or functional activity 
of the MTSP25. An increase or decrease in the level of the DNA, RNA, protein 
25 or functional activity of the MTSP, relative to the level of the DNA, RNA, protein 
or functional activity found in an analogous sample not having the disease or 
disorder (or other suitable control) is indicative of the presence of the disease or 
disorder in the subject. 

Also provided are methods of identifying a compound that binds to the 
30 single-chain and/or two-chain form of MTSP25, by contacting a test compound 
with both forms; determining to which form the compound binds; and if it binds 
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to a form of MTSP25, further determining whether the compound has at least 
one of the following properties: 

(i) inhibits activation of the single-chain zymogen form of MTSP25; 

(ii) inhibits activity of the two-chain or single-chain form; and 
5 (iii) inhibits dimerization of the protein. 

The forms can be full length or truncated forms, including but not limited to, the 
protease domain resulting from cleavage at the activation cleavage site (between 
amino acids R 77 and i 78 SEQ ID No. 17); or from expression of the protease 
domain or catalytically active portions thereof. 

10 Pharmaceutical compositions containing the protease domain and/or full- 

length or other domain of an MTSP25 polypeptide are provided herein in a 
pharmaceutical^ acceptable carrier or excipient are provided herein. 

Also provided are articles of manufacture that contain MTSP25 
polypeptide and protease domains of MTSP25 in single chain forms or activated 

15 forms. The articles contain a) packaging material; b) the polypeptide (or 

encoding nucleic acid), particularly the single chain protease domain thereof; and 
c) a label indicating that the article is for use in assays for identifying modulators 
of the activities of an MTSP25 polypeptide and are provided herein. 

Conjugates containing a) an MTSP25 polypeptide or protease domain in a 

20 single or two chain form; and b) a targeting agent linked to the MTSP directly or 
via a linker, wherein the agent facilitates: i) affinity isolation or purification of the 
conjugate; ii) attachment of the conjugate to a surface; iii) detection of the 
conjugate; or iv) targeted delivery to a selected tissue or cell, are provided 
herein. The conjugate can contain a plurality of agents linked thereto. The 

25 conjugate can be a chemical conjugate; and it can be a fusion protein. The 
targeting agent can be a protein or peptide fragment. The protein or peptide 
fragment can include a protein binding sequence, a nucleic acid binding 
sequence, a lipid binding sequence, a polysaccharide binding sequence, or a 
metal binding sequence. 

30 Combinations, kits and articles of manufacture containing the MTSP25 

polypeptides, domains thereof, or encoding nucleic acids are also provided 
herein^ For example, combinations are provided herein. The combination can 
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include: a) an inhibitor of the activity of an MTSP25; and b) an anti-cancer 
treatment or agent. The MTSP inhibitor and the anti-cancer agent can be 
formulated in a single pharmaceutical composition or each is formulated in a 
separate pharmaceutical composition. The MTSP25 inhibitor can be an antibody 
5 or a fragment or binding portion thereof made against the MTSP25, such as an 
antibody that specifically binds to the protease domain, an inhibitor of MTSP25 
production, or an inhibitor of MTSP25 membrane-localization or an inhibitor of 
MTSP25 activation. Other MTSP25 inhibitors include, but are not limited to, an 
antisense nucleic acid or double-stranded RNA (dsRNA), such as RNAi, encoding 
10 the MTSP25, particularly a portion of the protease domain; a nucleic acid 

encoding at least a portion of a gene encoding the MTSP25 with a heterologous 
nucleotide sequence inserted therein such that the heterologous sequence 
inactivates the biological activity of MTSP25 or the gene encoding it. For 
example, the portion of the gene encoding the MTSP25 can flank the 
15 heterologous sequence to promote homologous recombination with a genomic 
gene encoding the MTSP25. 

Also provided are methods for treating or preventing a tumor or cancer in 
a mammal by administering to a mammal an effective amount of an inhibitor of 
an MTSP25, whereby the tumor or cancer is treated or prevented. The MTSP25 
20 inhibitor used in the treatment or for prophylaxis is administered with a 

pharmaceutically acceptable carrier or excipient. The mammal treated can be a 
human. The treatment or prevention method canvadditionally include 
administering an anti-cancer treatment or agent simultaneously with, subsequent 
to, or before administration of the MTSP25 inhibitor. 
25 Also provided are transgenic non-human animals bearing inactivated 

genes encoding the MTSP and bearing the genes encoding the MTSP25 under 
non-native promotor control are provided. Such animals are useful in animal 
models of tumor initiation, growth and/or progression models. Transgenic non- 
human animals containing heterolgous nucleic acid MTSP25 under native, non- 
30 native promotor control or on an exogenous element, such as a plasmid or 
artificial chromosome, are additionally provided herein. In particular, 
recombinant non-human animals are provided herein, where the gene of an 
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MTSP25 is under control of a promoter that is not the native promoter of the 
gene or that is not the native promoter of the gene in the non-human animal or 
where the nucleic acid encoding the MTSP25 is heterologous to the non-human 
animal and the promoter is the native or a non-native promoter or the MTSP25 is 
5 on an extrachromosomal element, such as a plasmid or artificial chromosome. 
Recombinant and transgenic animals can be produced by homologous 
recombination and non-homologous recombination methods. 

Methods of gene therapy are provided. Such methods can be effected by 
administering in vivo or ex vivo an inactivating form of the MTSP25 or by 
10 administering an MTSP-encoding nucleic acid molecule are also provided. 

Also provided are methods of treatment of tumors by administering a 
prodrug that is activated by MTSP25 that is expressed or active in tumor cells, 
particularly those in which its functional activity in tumor cells is greater than in 
non-tumor cells. The prodrug is administered and, upon administration, active 
15 MTSP25 expressed on cells cleaves the prodrug and releases active drug in the 
vicinity of the tumor cells. The active anti-cancer drug accumulates in the 
vicinity of the tumor. This is particularly useful in instances in which MTSP25 is 
expressed or active in greater quantity, higher level or predominantly in tumor 
cells compared to other cells. 
20 Also provided are methods of diagnosing the presence of a pre-malignant 

lesion, a malignancy, or other pathologic condition in a subject, by obtaining a 
biological sample from the subject; and exposing it to a detectable agent that 
binds to a two-chain and/or single-chain form of MTSP25, where the 
pathological condition is characterized by the presence or absence of the two- 
25 chain and/or single-chain form. 

Methods of inhibiting tumor invasion or metastasis or treating a malignant 
or pre-malignant condition by administering an agent that inhibits activation of 
the zymogen form of MTSP25 or an activity of the activated form are provided. 
The conditions include, but are not limited to, a condition, such as a tumor, of 
30 the breast, cervix, prostate, lung, ovary or colon. 
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DET AILED DESCRIPTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
5 which the invention(s) belong. All patents, patent applications, published 

applications and publications, Genbank sequences, websites and other published 
materials referred to throughout the entire disclosure herein, unless noted 
otherwise, are incorporated by reference in their entirety. In the event that there 
are a plurality of definitions for terms herein, those in this section prevail. 

10 Where reference is made to a URL or other such identifier or address, it 

understood that such identifiers can change and particular information on the 
internet can come and go, but equivalent information can be found by searching 
the internet. Reference thereto evidences the availability and public 
dissemination of such information. 

15 As used herein, the abbreviations for any protective groups, amino acids 

and other compounds, are, unless indicated otherwise, in accord with their 
common usage, recognized abbreviations, or the IUPAC-IUB Commission on 
Biochemical Nomenclature (see, (1972) Biochem. 7 7:942-944). 

As used herein, serine protease refers to a diverse family of proteases 

20 wherein a serine residue is involved in the hydrolysis of proteins or peptides. 

The serine residue can be part of the catalytic triad mechanism, which includes a 
serine, a histidine and an aspartic acid in the catalysis, or be part of the 
hydroxyl/e-amine or hydroxy I /cr-a mine catalytic dyad mechanism, which involves 
a serine and a lysine in the catalysis. Of particular interest are SPs of 

25 mammalian, including human, origin. Those of skill in this art recognize that, in 
general, single amino acid substitutions in non-essential regions of a polypeptide 
do not substantially alter biological activity (see, e.g., Watson et al. (1987) 
Molecular. Biology of the Gene, 4th Edition, The Benjamin/Cummings Pub. co., 
p. 224). 

30 As used herein, "transmembrane serine protease (MTSP)" refers to. a 

family of transmembrane serine proteases that share common structural features 
as described herein {see, also Hooper et aL (2001) J. Biol. Chem. 276:857-860). 
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Thus, reference, for example, to "MTSP" encompasses all proteins encoded by 
the MTSP gene family, including but are not limited to: MTSP3, MTSP4, 
MTSP6, MTSP7, MTSP9, MTSP10, MTSP10 or an equivalent molecule obtained 
from any other source or that has been prepared synthetically or that exhibits 
5 the same activity. Other MTSPs include, but are not limited to, corin, 

enteropeptidase, human airway trypsin-like protease (HAT), MTSP1, TMPRSS2 
and TMPRSS4. Sequences of encoding nucleic acid molecules and the encoded 
amino acid sequences of exemplary MTSPs and/or domains thereof are set forth, 
for example in U.S. application Serial No. 09/776,191 (SEQ ID Nos. 1-12, 49, 
10 50 and 61-72 therein, published as International PCT application No. WO 

01/57194). The term also encompass MTSPs with amino acid substitutions that 
do not substantially alter activity of each member and also encompasses splice 
variants thereof. Suitable substitutions, including, although not necessarily, 
conservative substitutions of amino acids, are known to those of skill in this art 
1 5 and can be made without eliminating the biological activity, such as the catalytic 
activity, of the resulting molecule. 

As used herein, Type I MTSP refers to transmembrane proteins made with 
an N-terminal signal peptide that is cleaved so that the new N-terminus is on the 
extracytoplasmic side of the membrane. The original N-terminus likely stays on 
20 the cytoplasmic side, and cleavage occurs on the other side of the membrane. 
These proteins are anchored through the C-terminus. 

As used herein, Type If MTSP refers to transmembrane proteins that are 
synthesized with N-terminal or internal signal peptides that are not cleaved and 
that serve as a membrane anchor. 
25 As used herein an MTSP25, whenever referenced herein, is a type I 

transmembrane protein, and includes at least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 5 or SEQ ID No. 15 or by a sequence of nucleotides that includes 
nucleotides that encode the sequence of amino acids set forth in SEQ ID No. 6 
30 or SEQ ID No. 16; 
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a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in is set forth as SEQ ID No. 5 or SEQ ID No. 1 5; 

a polypeptide that includes the sequence of amino acids set forth 
5 in SEQ ID No. 6 or SEQ ID No. 1 6 or a catalytically active portion thereof; 

a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
sequence identity with the sequence of amino acids set forth as amino acids 1- 
10 237 in SEQ ID No. 6 or as amino acids 78-313, 78-314, 78-315 ... or 78-323 
in SEQ ID No. 1 6, where amino acid 127 is optionally Thr; and/or 

a polypeptide encoded by a splice variant of the MTSP25 that 
includes the sequence of amino acids set forth in SEQ ID No. 6 or SEQ ID No. 
16. Also included are MTSP25s as described throughout the disclosure herein 
15 ,n particular, MTSP25 polypeptides, with a protease domain 

encompassing amino acids 1-237 as indicated in SEQ ID Nos. 5 and 6, are 
provided. The polypeptide is a single or two chain polypeptide. Smaller 
portions thereof that retain protease activity are also provided. The protease 
domains from MTSPs vary in size and constitution, including insertions and 
20 deletions in surface loops. They retain conserved structure, including at least 
one of the active site triad, primary specificity pocket, oxyanion hole and/or 
other features of serine protease domains of proteases. 

The protease domains of MTSPs vary in size and constitution, including 
insertions and deletions in surface loops. They retain conserved structure, 
25 including at least one of the active site triad, primary specificity pocket, 

oxyanion hole and/or other features of serine protease domains of proteases. 
Thus, for purposes herein, the protease domain is a portion of an MTSP, as 
defined herein, and is homologous to a domain of other MTSP. Thus, for 
purposes herein, the protease domain is a portion of an MTSP, as defined herein, 
30 and is homologous to a domain of other MTSPs, such as corin, enteropeptidase, 
human airway trypsin-like protease (HAT), MTSP1, TMPRSS2, and TMPRSS4, 
which have been previously identified; it was not recognized, however, that an 
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isolated single chain form of the protease domain could function proteolyticaliy in 
in vitro assays. As with the larger class of enzymes of the chymotrypsin (S1) 
fold (see, e.g., Internet accessible MEROPS data base), the MTSP protease 
domains share a high degree of amino acid sequence identity. The His, Asp and 
5 Ser residues necessary for activity are present in conserved motifs. The 

activation/cleavage site, whose cleavage creates the N-terminus of the protease 
domain in the two-chain forms is located in a conserved motif and readily can be 
identified. The activation site, which results in the N-terminus of the second 
chain in the two chain form is located in a conserved motif and readily can be 
10 identified. In the exemplified MTSP25, it is between residues R 77 and l 78 of SEQ 
ID No. 16. 

The MTSP25 can be from any animal, particularly a mammal, and 
includes but are not limited to, primates, including humans, rodents, fowl, 
ruminants and other animals. The full-length zymogen or two-chain activated 

15 form is contemplated or any domain thereof, including the protease domain, 
which can be a two-chain activated form, or a single chain form. 

As used herein, a "protease domain of an MTSP" refers to an extracellular 
protease domain of an MTSP that exhibits proteolytic activity and shares 
homology and structural features with the chymotrypsin/trypsin family protease 

20 domains. Hence it is at least the minimal portion of the domain that exhibits 
proteolytic activity as assessed by standard in vitro assays. Contemplated 
herein are such protease domains and catalytically active portions thereof. Also 
provided are truncated forms of the protease domain that include the smallest 
fragment thereof that acts catalytically as a single chain form. 

25 By active form is meant a form active in vivo and/or in vitro. As 

described herein, the protease domain also can exist as a two-chain form. It is 
shown herein that, at least in vitro, the single chain forms of the SPs and the 
catalytic domains or proteolyticaliy active portions thereof {typically C-terminal 
truncations) exhibit protease activity. Hence provided herein are isolated single 

30 chain forms of the protease domains of SPs and their use in in vitro drug 

screening assays for identification of agents that modulate the activity thereof. 
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As used herein, the catalytically active domain of an MTSP refers to the 
protease domain. Reference to the protease domain of an MTSP generally refers 
to the single chain form of the protein. If the two-chain form or both forms is 
intended, it is so-specified. The zymogen form of each protein is a single chain, 
5 which is converted to the active two chain form by activation cleavage. 

A protease domain of an MTSP25, whenever referenced herein, includes 
at least one or all of or any combination of or a catalytically active portion of: 

a) a single chain or two chain polypeptide that includes the 
sequence of amino acids set forth as residues 1-237 in SEQ ID No. 6, residues 

10 78-323 (or truncated variants (78-313, 314, 315, 316 . . , 322, or shorter 
variants that are catalytically active, following cleavage)) in SEQ ID No. 16 or 
catalytically active portions thereof; 

b) a two chain polypeptide that includes the sequence of amino 
acids that includes at least amino acids 64-191 as set forth in SEQ ID No. 16 

15 and can include amino acids 17-323 (or truncated variants (78-313, 314, 315, 
316... 322, or shorter variants that are catalytically active, following 
cleavage), and truncated forms terminating with any residue between 17 and 64 
(i.e., residue 18, 19, 20, 21, 22, 23 . . . 60, 61, 62, 63) and catalytically active 
portions of any of these polypeptides; 

20 c ) a polypeptide that is a single chain or two chain polypeptide 

that includes the sequence of amino acids set forth as residues 1-237 in SEQ ID 
No. 6 or 78-323 (or truncated variants (78-313, 314, 31 5, 31 6 . 322, or 
shorter variants that are catalytically active, following cleavage) in SEQ ID No. 
1 6 or a catalytically active portion thereof up to but not including the full-length 

25 protein, typically starting at about residue 17; 

d) a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
sequence identity with the sequence of amino acids of any of a), b) or c); 

30 e ) a protease domain of a polypeptide encoded by a splice variant 

of a sequence of nucleotides that encodes an MTSP25 of any of a)-d); and/or 
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f) a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides encoding the polypeptides of any of a)-e) where the sequence of 
nucleotides hybridizes along at least about 70%, 80% or 90% of its full-length 
5 under such conditions. 

As used herein, activation cleavage refers to the cleavage of the protease 
at the N-terminus of the protease domain (generally between an R and I in the 
full-length protein, which includes amino acid residue 1 of SEQ ID No. 6, residue 
78-323 (or truncated variants (78-313, 314, 315, 316 . . . 322, or shorter 

10 variants that are catalyticaliy active, following cleavage) of SEQ ID No. 16). By 
virtue of the Cys-Cys pairing between a Cys outside the protease domain and a 
Cys in the protease domain (in this instance Cys 19l SEQ ID No. 16, upon 
cleavage the resulting polypeptide has two chains ("A" chain and the "B" chain, 
which is the protease domain). Cleavage can be effected by another protease or 

15 autocatalytically. 

As used herein, a two-chain form of the protease domain refers to a two- 
chain form that is formed from the two-chain form of the protease in which the 
Cys pairing between, in this instance, a Cys outside the protease domain and 
Cys 573 (SEQ ID No. 23), which links the protease domain to the remainder of the 

20 polypeptide, the "A" chain. A two chain protease domain form refers to any 
form in which the "remainder of the polypeptide", Le. t "A" chain, is shortened 
and includes from at the Cys outside the protease domain. For example a two 
chain form of an MTSP25 includes from Cys 29a up to and including Cys 573 of SEQ 
ID No. 23 where the A chain includes Cys 296 to R 462 and the B chain includes l 463 

25 to at least Cys 573 . 

MTSPs of interest include those that are activated and/or expressed in 
tumor cells different, typically higher, from those in non-tumor cells; and those 
from cells in which substrates therefor differ in tumor cells from non-tumor cells 
or differ with respect to the substrates, co-factors or receptors, or otherwise 
30 alter the activity or specificity of the MTSP. 
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As used herein, a human protein is one encoded by nucleic acid, such as 
DIMA, present in the genome of a human, including all allelic variants and 
conservative variations as long as they are not variants found in other mammals. 
As used herein, a "nucleic acid encoding a protease domain or 
5 catalytically active portion of a SP" shall be construed as referring to a nucleic 
acid encoding only the recited single chain protease domain or active portion 
thereof, and not the other contiguous portions of the SP as a continuous 
sequence. 

As used herein, catalytic activity refers to the activity of the SP as a 
10 serine protease. Function of the SP refers to its function in tumor biology, 
including promotion of or involvement in initiation, growth or progression of 
tumors, and also roles in signal transduction. Catalytic activity refers to the 
activity of the SP as a protease as assessed in in vitro proteolytic assays that 
detect proteolysis of a selected substrate. 
15 As use d herein, a CUB domain is a motif that mediates protein-protein 

interactions in complement components C1r/C1s and has also been identified in 
various proteins involved in developmental processes. 

As used herein, "LDLR" refers to a low density lipoprotein receptor 
domain, which mediates binding to an LDL receptor. 
20 As used herein, a zymogen is an inactive precursor of a proteolytic 

enzyme. Such precursors are generally larger, although not necessarily larger 
than the active form. With reference to serine proteases, zymogens are 
converted to active enzymes by specific cleavage, including catalytic and 
autocatalytic cleavage, or by binding of an activating co-factor, which generates 
25 an active enzyme. A zymogen, thus, is an enzymatically inactive protein that is 
converted to a proteolytic enzyme by the action of an activator. 

As used herein, "disease or disorder" refers to a pathological condition in 
an organism resulting from, e.g., infection or genetic defect, and characterized 
by identifiable symptoms. 

30 As used herein, neoplasm (neoplasia) refers to abnormal new growth, and 

thus means the same as tumor, which can be benign or malignant. Unlike 
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hyperplasia, neoplastic proliferation persists even in the absence of the or\g\na\ 
stimulus. 

As used herein, neoplastic disease refers to any disorder involving cancer, 
including tumor development, growth, metastasis and progression. 
5 As used herein, cancer refers to a general term for diseases caused by 

any type of malignant tumor. 

As used herein, malignant, as applies to tumors, refers to primary tumors 
that have the capacity of metastasis with loss of growth control and positional 
control. 

10 As used herein, an anti-cancer agent (used interchangeably with "anti- 

tumor or antineoplastic agent") refers to any agents used in the anti-cancer 
treatment. These include any agents, when used alone or in combination with 
other compounds, that can alleviate, reduce, ameliorate, prevent, or place or 
maintain in a state of remission of clinical symptoms or diagnostic markers 

1 5 associated with neoplastic disease, tumor and cancer, and can be used in 
methods, combinations and compositions provided herein. Non-limiting 
examples of anti-neoplastic agents include anti-angiogenic agents, alkylating 
agents, antimetabolites, certain natural products, platinum coordination 
complexes, anthracenediones, substituted ureas, methylhydrazine derivatives, 
20 adrenocortical suppressants, certain hormones, antagonists and anti-cancer 
polysaccharides. 

As used herein, a splice variant refers to a variant produced by differential 
processing of a primary transcript of genomic nucleic acid, such as DNA, that 
results in more than one type of mRNA. Splice variants of SPs are provided 
25 herein. 

As used herein, angiogenesis is intended to broadly encompass the 
totality of processes directly or indirectly involved in the establishment and 
maintenance of new vasculature (neovascularization), including, but not limited 
to, neovascularization associated with tumors. 
30 As used herein, anti-angiogenic treatment or agent refers to any 

therapeutic regimen and compound, when used alone or in combination with 
other treatment or compounds, that can alleviate, reduce, ameliorate, prevent, or 
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place or maintain in a state of remission of clinical symptoms or diagnostic 
markers associated with uridesired and/or uncontrolled angiogenesis. Thus, for 
purposes herein an anti-angiogenic agent refers to an agent that inhibits the 
establishment or maintenance of vasculature. Such agents include, but are not 
5 limited to, anti-tumor agents, and agents for treatments of other disorders 
associated with undesirable angiogenesis, such as diabetic retinopathies, 
restenosis, hyperproliferative disorders and others. 

As used herein, non-anti-angiogenic anti-tumor agents refer to anti-tumor 
agents that do not act primarily by inhibiting angiogenesis. 
10 As used herein, pro-angiogenic agents are agents that promote the 

establishment or maintenance of the vasculature. Such agents include agents 
for treating cardiovascular disorders, including heart attacks and strokes. 

As used herein, undesired and/or uncontrolled angiogenesis refers to 
pathological angiogenesis wherein the influence of angiogenesis stimulators 
15 outweighs the influence of angiogenesis inhibitors. As used herein, deficient 

angiogenesis refers to pathological angiogenesis associated with disorders where 
there is a defect in normal angiogenesis resulting in aberrant angiogenesis or an 
absence or substantial reduction in angiogenesis. 

As used herein, the protease domain of an SP protein refers to the 
20 protease domain of an SP that exhibits proteolytic activity. Hence it is at least 
the minimal portion of the protein that exhibits proteolytic activity as assessed 
by standard assays in vitro. It refers, herein, to a single chain form and also the 
two chain activated form (where the two chain form is intended it will be so- 
noted). Exemplary protease domains include at least a sufficient portion of 
25 sequences of amino acids set forth in SEQ ID No. 6 (amino acids 1-230, 
encoded by nucleotides in SEQ ID No. 5) to exhibit protease activity. 

Also contemplated are nucleic acid molecules that encode a polypeptide 
that has proteolytic activity in an in vitro proteolysis assay and that have at least 
60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 
30 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence 

identity with the full-length or a protease domain of an MTSP25 polypeptide or 
other domain thereof, or that hybridize along their fuiMength or along at least 
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about 70%, 80% or 90% of its full-length to a nucleic acids that encode a 
protease domain or other domain, particularly under conditions of moderate, 
generally high, stringency. 

For the protease domains, residues at the N-terminus can be critical for 
5 activity. It is shown herein that the protease domain of the single chain form of 
the MTSP25 protease is catalytically active. Hence the protease domain 
generally requires the N-terminal amino acids thereof for activity; the C-terminus 
portion can be truncated. The amount that can be removed can be determined 
empirically by testing the polypeptide for protease activity in an in vitro assay 
10 that assesses catalytic cleavage. 

Hence smaller portions of the protease domains, particularly the single 
chain domains, thereof that retain protease activity are contemplated. Such 
smaller versions generally are C-terminal truncated versions of the protease 
domains. Such domains exhibit conserved structure, including at least one 
15 structural feature, such as the active site triad, primary specificity pocket, 

oxyanion hole and/or other features of serine protease domains of proteases. 
Thus, for purposes herein, the protease domain is a single chain portion of an 
MTSP25, as defined herein, but is homologous in its structural features and 
retention of sequence of similarity or homology the protease domain of 
20 chymotrypsin or trypsin. The polypeptide exhibits proteolytic activity as a single 
chain. 

As used herein, "homologous" means about greater than 25% nucleic 
acid sequence identity, such as 25% 40%, 60%, 70%, 80%, 90% or 95%. If 
necessary the percentage homology will be specified. The terms "homology" 

25 and "identity" are often used interchangeably. In general, sequences are aligned 
so that the highest order match is obtained (see, e.g.: Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic 
Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, 

30 A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
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10 



Press, New York, 1991; Carillo et al. (1988) SI AM J Applied Math 4S:1073). 
By sequence identity, the number of conserved amino acids are determined by 
standard alignment algorithms programs, and are used with default gap penalties 
established by each supplier. Substantially homologous nucleic acid molecules 
would hybridize typically at moderate stringency or at high stringency all along 
the length of the nucleic acid or along at least about 70%, 80% or 90% of the 
full-length nucleic acid molecule of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the hybridizing 
nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 
"identical" can be determined using known computer algorithms such as the 
"FAST A" program, using for example, the default parameters as in Pearson et 
al. (1988) Proc. Natl. Acad. Sci. USA S5:2444 (other programs include the GCG 
15 program package (Devereux, J., et al.. Nucleic Acids Research 12(l):387 

(1984)), BLASTP, BLASTN, FAST A (Atschul, S.F., et al., J Molec Biol 2 75:403 
(1990); Guide to Huge Computers, Martin J. Bishop, ed.. Academic Press, San 
Diego, 1994, and Carillo era/. (1988) SI AM J Applied Math 48AQ13). For 
example, the BLAST function of the National Center for Biotechnology 
20 Information database can be used to determine identity. Other commercially or 
publicly available programs include, DNAStar "MegAlign" program (Madison, Wl) 
and the University of Wisconsin Genetics Computer Group (UWG) "Gap" 
program (Madison Wl)). Percent homology or identity of proteins and/or nucleic 
acid molecules can be determined, for example, by comparing sequence 
information using a GAP computer program (e.g., Needleman et al. (1970) J. 
Mol. Biol. 48:443, as revised by Smith and Waterman ((1 981 ) Adv. Appl. Math. 
2:482). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. Default parameters for 
the GAP program can include: (1) a unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix 
of Gribskov et al. (1986) Nucl. Acids Res. 14:6745, as described by Schwartz 



25 



30 
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and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 
each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
penalty for end gaps. Therefore, as used herein, the term "identity" represents a 
5 comparison between a test and a reference polypeptide or polynucleotide. 

As used herein, the term "at least 90% identical to" refers to percent 
identities from 90 to 99.99 relative to the reference polypeptides, Identity at a 
level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 100 amino acids are 
10. compared, no more than 10% (i.e., 10 out of 100) of amino acids in the test 

polypeptide differs from that of the reference polypeptides. Similar comparisons 
can be made between a test and reference polynucleotides. Such differences 
can be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they can be clustered in one or more 
15 locations of varying length up to the maximum allowable, e.g. 10/100 amino 

acid difference (approximately 90% identity). Differences are defined as nucleic 
acid or amino acid substitutions, or deletions. At the level of homologies or 
identities above about 85-90%, the result should be independent of the program 
and gap parameters set; such high levels of identity can be assessed readily, 
20 often without relying on software. 

As used herein, primer refers to an oligonucleotide containing two or 
more deoxyribonucleotides or ribonucleotides, typically more than three, from 
which synthesis of a primer extension product can be initiated. Experimental 
conditions conducive to synthesis include the presence of nucleoside 
25 triphosphates and an agent for polymerization and extension, such as DNA 
polymerase, and a suitable buffer, temperature and pH. 

As used herein, animals include any animal, such as, but are not limited 
to, goats, cows, deer, sheep, rodents, pigs and humans. Non-human animals 
exclude humans as the contemplated animal. The SPs provided herein are from 
30 any source, animal, plant, prokaryotic and fungal. Most MTSP25s are of animal 
origin, including mammalian origin. 
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As used herein, genetic therapy involves the transfer of heterologous 
nucleic acid, such as DNA, into certain cells, target cells, of a mammal, 
particularly a human, with a disorder or conditions for which such therapy is 
sought. The nucleic acid, such as DNA, is introduced into the selected target 
5 cells in a manner such that the heterologous nucleic acid, such as DNA, is 
expressed and a therapeutic product encoded thereby is produced. 
Alternatively, the heterologous nucleic acid, such as DNA, can in some manner 
mediate expression of DNA that encodes the therapeutic product, or it can 
encode a product, such as a peptide or RNA that in some manner mediates, 

1 0 directly or indirectly, expression of a therapeutic product. Genetic therapy can 
also be used to deliver nucleic acid encoding a gene product that replaces a 
defective gene or supplements a gene product produced by the mammal or the 
cell in which it is introduced. The introduced nucleic acid can encode a 
therapeutic compound, such as a growth factor inhibitor thereof, or a tumor 

15 necrosis factor or inhibitor thereof, such as a receptor therefor, that is not 
normally produced in the mammalian host or that is not produced in 
therapeutically effective amounts or at a therapeutically useful time. The 
heterologous nucleic acid, such as DNA, encoding the therapeutic product can 
be modified prior to introduction into the cells of the afflicted host in order to 

20 enhance or otherwise alter the product or expression thereof. Genetic therapy 
can also involve delivery of an inhibitor or repressor or other modulator of gene 
expression. 

As used herein, heterologous nucleic acid is nucleic acid that (if DNA 
encodes RNA) and proteins that are not normally produced in vivo by the cell in 

25 which it is expressed or that mediates or encodes mediators that alter expression 
of endogenous nucleic acid, such as DNA, by affecting transcription, translation, 
or other regulatable biochemical processes. Heterologous nucleic acid, such as 
DNA, can also be referred to as foreign nucleic acid, such as DNA. Any nucleic 
acid, such as DNA, that one of skill in the art would recognize or consider as 

30 heterologous or foreign to the cell in which the nucleic acid is expressed is 
herein encompassed by heterologous nucleic acid; heterologous nucleic acid 
includes exogenously added nucleic acid that is also expressed endogenously. 
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Examples of heterologous nucleic acid include, but are not limited to, nucleic 
acid that encodes traceable marker proteins, such as a protein that confers drug 
resistance, nucleic acid that encodes therapeutically effective substances, such 
as anti-cancer agents, enzymes and hormones, and nucleic acid, such as DMA, 
5 that encodes other types of proteins, such as antibodies. Antibodies that are 
encoded by heterologous nucleic acid can be secreted or expressed on the 
surface of the cell in which the heterologous nucleic acid has been introduced. 
Heterologous nucleic acid is generally not endogenous to the cell into which it is 
introduced, but has been obtained from another cell or prepared synthetically. 
10 Generally, although not necessarily, such nucleic acid encodes RNA and proteins 
that are not normally produced by the cell in which it is now expressed. 

As used herein, a therapeutically effective product for gene therapy is a 
product that is encoded by heterologous nucleic acid, typically DNA, that, upon 
introduction of the nucleic acid into a host, a product is expressed that 
1 5 ameliorates or eliminates the symptoms, manifestations of an inherited or 

acquired disease or that cures the disease. Also included are biologically active 
nucleic acid molecules, such as RNAi and antisense. 

As used herein, recitation that a polypeptide consists essentially of the 
protease domain means that the only SP portion of the polypeptide is a protease 
20 domain or a catalytically active portion thereof. The polypeptide can optionally, 
and generally will, include additional non-SP-derived sequences of amino acids. 

As used herein, cancer or tumor treatment or agent refers to any 
therapeutic regimen and/or compound that, when used alone or in combination 
with other treatments or compounds, can alleviate, reduce, ameliorate, prevent, 
25 or place or maintain in a state of remission of clinical symptoms or diagnostic 
markers associated with deficient angiogenesis. 

As used herein, domain refers to a portion of a molecule, e.g., protein or 
the encoding nucleic acid, that is structurally and/or functionally distinct from 
other portions of the molecule. 
30 As used herein, protease refers to an enzyme catalyzing hydrolysis of 

proteins or peptides. It includes the zymogen form and activated forms thereof. 
For clarity, reference to protease refers to all forms, and particular forms will be 
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specifically designated. For purposes herein, the protease domain includes single 
and two chain forms of the protease domain of an SP protein. For MTSP25 the 
protease domain also includes single and two chain forms of the protease 
domain* 

5 As used herein, nucleic acids include DNA, RNA and analogs thereof, 

including peptide nucleic acids (PNA) and mixtures thereof. Nucleic acids can be 
single or double-stranded. When referring to probes or primers, optionally 
labeled, with a detectable label, such as a fluorescent or radiolabel, single- 
stranded molecules are contemplated. Such molecules are typically of a length 
10 such that their target is statistically unique or of low copy number (typically less 
than 5, generally less than 3) for probing or priming a library. Generally a probe 
or primer contains at least 14, 16 or 30 contiguous of sequence complementary 
to or identical a gene of interest. Probes and primers can be 10, 20, 30, 50, 
100 or more nucleic acids long. 
1 5 As used herein, nucleic acid encoding a fragment or portion of an SP 

refers to a nucleic acid encoding only the recited fragment or portion of SP, and 
not the other contiguous portions of the SP. 

As used herein, operative linkage of heterologous nucleic acids to 
regulatory and effector sequences of nucleotides, such as promoters, enhancers, 
20 transcriptional and translat/onal stop sites, and other signal sequences refers to 
the relationship between such nucleic acid, such as DNA, and such sequences of 
nucleotides. Thus, operatively linked or operationally associated refers to the 
functional relationship of nucleic acid, such as DNA, with regulatory and effector 
sequences of nucleotides, such as promoters, enhancers, transcriptional and 
25 translational stop sites, and other signal sequences. For example, operative 
linkage of DNA to a promoter refers to the physical and functional relationship 
between the DNA and the promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that specifically recognizes, 
binds to and transcribes the DNA. In order to optimize expression and/or in vitro 
30 transcription, it can be necessary to remove, add or alter 5' untranslated portions 
of the clones to eliminate extra, potential inappropriate alternative translation 
initiation (i.e., start) codons or other sequences that can interfere with or reduce 
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expression, either at the level of transcription or translation. Alternatively, 
consensus ribosome binding sites (see, e.g., Kozak J. Biol. Chem. 265:19867- 
19870 (1991)) can be inserted immediately 5' of the start codon and can 
enhance expression. The desirability of (or need for) such modification can be 
5 empirically determined. 

As used herein, a sequence complementary to at least a portion of an 
RNA, with reference to antisense oligonucleotides, means a sequence having 
sufficient complementarily to be able to hybridize with the RNA, generally under 
moderate or high stringency conditions, forming a stable duplex; in the case of 

10 double-stranded SP antisense nucleic acids, a single strand of the duplex DNA 
(or dsRNA) can thus be tested, or triplex formation can be assayed. The ability 
to hybridize depends on the degree of complementarily and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with a SP encoding RNA it can contain and still form a 

15 stable duplex (or triplex, as the case can be). One skilled in the art can ascertain 
a tolerable degree of mismatch by use of standard procedures to determine the 
melting point of the hybridized complex. 

For purposes herein, amino acid substitutions can be made in any of SPs 
and protease domains thereof provided that the resulting protein exhibits 

20 protease activity. Amino acid substitutions contemplated include conservative 
substitutions, such as those set forth in Table 1, which do not eliminate 
proteolytic activity. As described herein, substitutions that alter properties of 
the proteins, such as removal of cleavage sites and other such sites are also 
contemplated; such substitutions are generally non-conservative, but can be 

25 readily effected by those of skill in the art. 

Suitable conservative substitutions of amino acids are known to those of 
skill in this art and can be made generally without altering the biological activity, 
for example enzymatic activity, of the resulting molecule. Those of skill in this 
art recognize that, in general, single amino acid substitutions in non-essential 

30 regions of a polypeptide do not substantially alter biological activity (see, e.g., 
Watson et al. Molecular Biology of the Gene, 4th Edition, 1987, The 
Bejacmin/Cummings Pub. co., p.224). Also included within the definition, is the 
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15 



20 



catalytically active fragment of an SP, particularly a single chain protease 
portion. Conservative amino acid substitutions are made, for example, in 
accordance with those set forth in TABLE 1 as follows: 

TABLE 1 



Original residue 


Conservative substitution 


A 1 „ / A \ 

Ala (A) 


Gly; Ser, Abu 


Arg (R) 


Lys, orn 


Asn (N) 


Gin; His 


Cys <C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


He (1) 


Leu; Val; Met; Nle; Nva 


Leu (L) 


He; Val; Met; Nle; Nv 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; lie; NLe Val 


Ornithine 


Lys; Arg 


Phe (F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (WJ 


Tyr 


Tyr (Y) 


Trp; Phe 


Val {V) 


lie; Leu; Met; Nle; Nv 



25 Other SUbsl „ ■ — — .~ . w ww. i ucichhh euipii i<L"ctity or if 5 

accord with known conservative substitutions. 

As used herein, Abu is 2-aminobutyric acid; Orn is ornithine. 

As used herein, the amino acids, which occur in the various amino acid 
sequences appearing herein, are identified according to their well-known, three- 
30 letter or one-letter abbreviations. The nucleotides, which occur in the various 

DNA fragments, are designated with the standard single-letter designations used 
routinely in the art. 

As used herein, a probe or primer based on a nucleotide sequence 
disclosed herein, includes at least 10, 14, typically at least 16 contiguous 
35 sequence of nucleotides of SEQ ID No. 5 or SEQ ID No. 15, and probes of at 

least 30, 50 or 100 contiguous sequence of nucleotides of SEQ ID No. 5 or SEQ 
ID No. 15. The length of the probe or primer for unique hybridization is a 
function of the complexity of the genome of interest. 
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As used herein, amelioration of the symptoms of a particular disorder by 
administration of a particular pharmaceutical composition refers to any lessening, 
whether permanent or temporary, lasting or transient that can be attributed to or 
associated with administration of the composition. 
5 As used herein, antisense polynucleotides refer to synthetic sequences of 

nucleotide bases complementary to mRNA or the sense strand of double- 
stranded DNA. Admixture of sense and antisense polynucleotides under 
appropriate conditions leads to the binding of the two molecules, or 
hybridization. When these polynucleotides bind to (hybridize with) mRNA, 
10 inhibition of protein synthesis (translation) occurs. When these polynucleotides 
bind to double-stranded DNA, inhibition of RNA synthesis (transcription) occurs. 
The resulting inhibition of translation and/or transcription leads to an inhibition of 
the synthesis of the protein encoded by the sense strand. Antisense nucleic 
acid molecules typically contain a sufficient number of nucleotides to specifically 
bind to a target nucleic acid, generally at feast 5 contiguous nucleotides, often at 
least 14 or 16 or 30 contiguous nucleotides or modified nucleotides 
complementary to the coding portion of a nucleic acid molecule that encodes a 
gene of interest, for example, nucleic acid encoding a single chain protease 
domain of an SP. Antisense RNA, as well as other oligonucleotides and RNA 
20 moelcules, can include modified bases and ribonucleotide and nucleotide 
analogs. 

As used herein, an array refers to a collection of elements, such as 
antibodies, containing three or more members. An addressable array is one in 
which the members of the array are identifiable, typically by position on a solid 
phase support. Hence, in general the members of the array are immobilized on 
discrete identifiable loci on the surface of a solid phase. 

As used herein, antibody refers to an immunoglobulin, whether natural or 
partially or wholly synthetically produced, including any derivative thereof that 
retains the specific binding ability the antibody. Hence antibody includes any 
protein having a binding domain that is homologous or substantially homologous 
to an immunoglobulin binding domain. Antibodies include members of any 
immunoglobulin claims, including IgG, IgM, IgA, igD and IgE. 
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As used herein, antibody fragment refers to any derivative of an antibody 
that is less than full-length, retaining at least a portion of the full-length 
antibody's specific binding ability. Examples of antibody fragments include, but 
are not limited to, Fab, Fab', F(ab) 2 , single-chain Fvs (scFV), FV, dsFV diabody 
5 and Fd fragments. The fragment can include multiple chains linked together, 
such as by disulfide bridges. An antibody fragment generally contains at least 
about 50 amino acids and typically at least 200 amino acids. 

As used herein, a Fv antibody fragment is composed of one variable 
heavy domain (V H ) and one variable light domain linked by noncovalent 
10 interactions. 

As used herein, a dsFV refers to a Fv with an engineered intermolecular 

y 

disulfide bond, which stabilizes the V H -V L pair. 

As used herein, a F(ab) 2 fragment is an antibody fragment that results 
from digestion of an immunoglobulin with pepsin at pH 4.0-4.5; it can be 
15 recombinants expressed to produce the equivalent fragment. 

As used herein, Fab fragments are antibody fragments that result from 
digestion of an immunoglobulin with papain; they can be recombinantly 
expressed to produce the equivalent fragment. 

As used herein, scFVs refer to antibody fragments that contain a variable 
20 light chain (V L ) and variable heavy chain (V H ) covalently connected by a 
polypeptide linker in any order. The linker is of a length such that the two 
variable domains are bridged without substantial interference. Included linkers 
are {Gly-Ser) n residues with some Glu or Lys residues dispersed throughout to 
increase solubility. 

25 As used herein, humanized antibodies refer to antibodies that are 

modified to include human sequences of amino acids so that administration to a 
human does not provoke an immune response. Methods for preparation of such 
antibodies are known. For example, to produce such antibodies, the encoding 
nucleic acid in the hybridoma or other prokaryotic or eukaryotic cell, such as an 

30 E. coli or a CHO cell, that expresses the monoclonal antibody is altered by 

recombinant nucleic acid techniques to express an antibody in which the amino 
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acid composition of the non-variable region is based on human antibodies. 
Computer programs have been designed to identify such non-variable regions. 

As used herein, diabodies are dimeric scFV; diabodies typically have 
shorter peptide linkers than scFvs, and they generally dimerize. 
5 As used herein, production by recombinant means by using recombinant 

DNA methods means the use of the well known methods of molecular biology 
for expressing proteins encoded by cloned DNA. 

As used herein the term assessing is intended to include 
quantitative and qualitative determination in the sense of obtaining an 
10 absolute value for the activity of an SP, or a domain thereof, present in the 

sample, and also of obtaining an index, ratio, percentage, visual or other value 
indicative of the level of the activity. Assessment can be direct or indirect and 
the chemical species actually detected need not of course be the proteolysis 
product itself but can for example be a derivative thereof or some further 
15 substance. ' 

As used herein, biological activity refers to the in vivo activities of a 
compound or physiological responses that result upon in vivo administration of a 
compound, composition or other mixture. Biological activity, thus, encompasses 
therapeutic effects and pharmaceutical activity of such compounds, 
20 compositions and mixtures. Biological activities can be observed in in vitro 

systems designed to test or use such activities. Thus, for purposes herein the 
biological activity of a luciferase is its oxygenase activity whereby, upon 
oxidation of a substrate, light is produced. 

As used herein, functional activity refers to a polypeptide or portion 
25 thereof that displays one or more activities associated with a full-length protein. 
Functional activities include, but are not limited to, biological activity, catalytic or 
enzymatic activity, antigenicity (ability to bind to or compete with a polypeptide 
for binding to an anti-polypeptide antibody), immunogenicity, ability to form 
multimers, and the ability to specifically bind to a receptor or ligand for the 
30 polypeptide. 

As used herein, a molecule, such as an antibody, that specifically binds to 
a polypeptide typically has a binding affinity (K a ) of at least about 10 8 l/mol, 10 7 
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l/mol, 10 8 l/mol, 10 9 l/mol, 10 10 l/mol or greater and binds to a protein of interest 
generally with at least 10-fold, generally 100-fold or greater, affinity than to 
other proteins. For example, an antibody that specifically binds to the protease 
domain compared to the full-length molecule, such as the zymogen form, binds 
5 with at least about 2-fold, typically 5-fold ro 10-fold higher affinity, to a 

polyeptide that contains only the protease domain than to the zymogen form of 
the full-length. ED DOES THIS MAKE ANY SENSE??? Such specific binding is 
also referred to as selective binding. 

As used herein, a conjugate refers to the compounds provided herein that 
10 include one or more SPs, including an MTSP25, particularly single chain protease 
domains thereof, and one or more targeting agents. These conjugates include 
those produced by recombinant means as fusion proteins, those produced by 
chemical means, such as by chemical coupling, through, for example, coupling 
to sulfhydryl groups, and those produced by any other method whereby at least 
15 one SP, or a domain thereof, is linked, directly or indirectly via linker(s) to a 
targeting agent. 

As used herein, a targeting agent is any moiety, such as a protein or 
effective portion thereof, that provides specific binding of the conjugate to a cell 
surface receptor, which, can internalize the conjugate or SP portion thereof. A 
targeting agent can also be one that promotes or facilitates, for example, affinity 
isolation or purification of the conjugate; attachment of the conjugate to a 
surface; or detection of the conjugate or complexes containing the conjugate. 

As used herein, an antibody conjugate refers to a conjugate in which the 
targeting agent is an antibody. 

25 As used herein, derivative or analog of a molecule refers to a portion 

derived from or a modified version of the molecule. 

As used herein, an effective amount of a compound for treating a 

particular disease is an amount that is sufficient to ameliorate, or in some 

manner reduce the symptoms associated with the disease. Such an amount can 
30 be administered as a single dosage or can be administered according to a 

regimen, whereby it is effective. The amount can cure the disease but, typically, 
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is administered in order to ameliorate the symptoms of the disease. Repeated 
administration can be required to achieve the desired amelioration of symptoms. 

As used herein, equivalent, when referring to two sequences of nucleic 
acids, means that the two sequences in question encode the same sequence of 
5 amino acids or equivalent proteins. When equivalent is used in referring to two 
proteins or peptides, it means that the two proteins or peptides have 
substantially the same amino acid sequence with only amino acid substitutions 
(such as, but not limited to, conservative changes such as those set forth in 
Table 1 above) that do not substantially alter the activity or function of the 
10 protein or peptide. When equivalent refers to a property, the property does not 
need to be present to the same extent [e.g., two peptides can exhibit different 
rates of the same type of enzymatic activity), but the activities are usually 
substantially the same. Complementary, when referring to two nucleotide 
sequences, means that the two sequences of nucleotides are capable of 
15 hybridizing, typically with less than 25%, 15%, 5% or 0% mismatches between 
opposed nucleotides. If necessary, the percentage of complementarity will be 
specified. Typically the two molecules are selected such that they will hybridize 
under conditions of high stringency. 

As used herein, an agent that modulates the activity of a protein or 
20 expression of a gene or nucleic acid either decreases or increases or otherwise 
alters the activity of the protein or, in some manner, up- or down-regulates or 
otherwise alters expression of the nucleic acid in a cell. 

As used herein, inhibitor of the activity of an SP encompasses any 
substance that prohibits or decrease production, post-translational 
25 modification^), maturation, or membrane localization of the SP or any substance 
that interferes with or decreases the proteolytic efficacy of thereof, particularly 
of a single chain form in an in vitro screening assay. 

As used herein, a method for treating or preventing neoplastic disease 
means that any of the symptoms, such as the tumor, metastasis thereof, the 
30 vascularization of the tumors or other parameters by which the disease is 

characterized are reduced, ameliorated, prevented, placed in a state of remission, 
or maintained in a state of remission. It also means that the hallmarks of 
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neoplastic disease and metastasis can be eliminated, reduced or prevented by 
the treatment. Non-limiting examples of the hallmarks include uncontrolled 
degradation of the basement membrane and proximal extracellular matrix, 
migration, division, and organization of the endothelial cells into new functioning 
5 capillaries, and the persistence of such functioning capillaries. 

As used herein, pharmaceutical^ acceptable salts, esters or other 
derivatives of the conjugates include any salts, esters or derivatives that can be 
readily prepared by those of skill in this art using known methods for such 
derivatization and that produce compounds that can be administered to animals 

10 or humans without substantial toxic effects and that either are pharmaceutical^ 
active or are prodrugs. 

As used herein, a prodrug is a compound that, upon in vivo 
administration, is metabolized or otherwise converted to the biologically, 
pharmaceutical^ or therapeutically active form of the compound. To produce a 

15 prodrug, the pharmaceutical^ active compound is modified such that the active 
compound is regenerated by metabolic processes. The prodrug can be designed 
to alter the metabolic stability or the transport characteristics of a drug, to mask 
side effects or toxicity, to improve the flavor of a drug or to alter other 

* 

characteristics or properties of a drug. By virtue of knowledge of 
20 pharmacodynamic processes and drug metabolism in vivo, those of skill in this 
art, once a pharmaceutical^ active compound is known, can design prodrugs of 
the compound (see, e.g., Nogrady (1985) Medicinal Chemistry A Biochemical 
Approach, Oxford University Press, New York, pages 388-392). 

As used herein, a drug identified by the screening methods provided 
25 herein refers to any compound that is a candidate for use as a therapeutic or as 
a lead compound for the design of a therapeutic. Such compounds can be small 
molecules, including small organic molecules, peptides, peptide mimetics, 
antisense molecules or dsRNA, such as RNAi, antibodies, fragments of 
antibodies, recombinant antibodies and other such compounds that can serve as 
30 drug candidates or lead compounds. 

As used herein, a peptidomimetic is a compound that mimics the 
conformation and certain stereochemical features of the biologically active form 
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of a particular peptide. In general, peptidomimetics are designed to mimic 
certain desirable properties of a compound, but not the undesirable properties, 
such as flexibility, that lead to a loss of a biologically active conformation and 
bond breakdown. Peptidomimetics may be prepared from biologically active 
5 compounds by replacing certain groups or bonds that contribute to the 

undesirable properties with bioisosteres. Bioisosteres are known to those of 
skill in the art. For example the methylene bioisostere CH 2 S has been used as an 
amide replacement in enkephalin analogs (see, e.g.. Spatola (1983) pp. 267-357 
in Chemistry and Bioc hemistry of Amino Acids, Peptides, and Proteins . Weistein, 
10 Ed. volume 7, Marcel Dekker, New York). Morphine, which can be administered 
orally, is a compound that is a peptidomimetic of the peptide endorphin. For 
purposes herein, cyclic peptides are included among peptidomimetics. 

As used herein, a promoter region or promoter element refers to a 
segment of DNA or RNA that controls transcription of the DNA or RNA to which 
15 it is operatively linked. The promoter region includes specific sequences that are 
sufficient for RNA polymerase recognition, binding and transcription initiation. 
This portion of the promoter region is referred to as the promoter. In addition, 
the promoter region includes sequences that modulate this recognition, binding 
and transcription initiation activity of RNA polymerase. These sequences can be 
20 c/s acting or can be responsive to trans acting factors. Promoters, depending 
upon the nature of the regulation, can be constitutive or regulated. Exemplary 
promoters contemplated for use in prokaryotes include the bacteriophage T7 and 
T3 promoters. 

As used herein, a receptor refers to a molecule that has an affinity for a 
25 given ligand. Receptors can be naturally-occurring or synthetic molecules. 

Receptors can also be referred to in the art as anti-ligands. As used herein, the 
receptor and anti-ligand are interchangeable. Receptors can be used in their 
unaltered state or as aggregates with other spectes. Receptors can be attached 
to, covalently or noncovalently, or in physical contact with, a binding member, 
30 either directly or indirectly via a specific binding substance or linker. Examples 
of receptors, include, but are not limited to: antibodies, cell membrane receptors 
surface receptors and internalizing receptors, monoclonal antibodies and antisera 
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reactive with specific antigenic determinants [such as on viruses, cells, or other 
materials], drugs, polynucleotides, nucleic acids, peptides, cofactors, lectins, 
sugars, polysaccharides, cells, cellular membranes, and organelles. 

Examples of receptors and applications using such receptors, include but 
5 are not restricted to: 

a) enzymes: specific transport proteins or enzymes essential to survival 
of microorganisms, which could serve as targets for antibiotic [ligand] selection; 

b) antibodies: identification of a ligand-binding site on the antibody 
molecule that combines with the epitope of an antigen of interest can be 

10 investigated; determination of a sequence that mimics an antigenic epitope can 
lead to the development of vaccines of which the immunogen is based on one or 
more of such sequences or lead to the development of related diagnostic agents 
or compounds useful in therapeutic treatments such as for auto-immune diseases 

c) nucleic acids: identification of ligand, such as protein or RNA, binding 

15 sites; 

d) catalytic polypeptides: polymers, including polypeptides, that are 
capable of promoting a chemical reaction involving the conversion of one or 
more reactants to one or more products; such polypeptides generally include a 
binding site specific for at least one reactant or reaction intermediate and an 

20 active functionality proximate to the binding site, in which the functionality is 
capable of chemically modifying the bound reactant (see, e.g., U.S. Patent No. 
5,215,899); 

e) hormone receptors: determination of the ligands that bind with high 
affinity to a receptor is useful in the development of hormone replacement 

25 therapies; for example, identification of ligands that bind to such receptors can 
lead to the development of drugs to control blood pressure; and 

f) opiate receptors: determination of ligands that bind to the opiate 
receptors in the brain is useful in the development of less-addictive replacements 
for morphine and related drugs. 

30 As use d herein, sample refers to anything that contains an analyte for 

which an analyte assay is desired. The sample can be a biological sample, such 
as a biological fluid or a biological tissue. Examples of biological fluids include 
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urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral spinal fluid, 
tears, mucus, sperm, amniotic fluid or the like. Biological tissues are aggregates 
of cells, usually of a particular kind together with their intercellular substance 
that form one of the structural materials of a human, animal, plant, bacterial, 
5 fungal or viral structure, including connective, epithelium, muscle and nerve 
tissues. Examples of biological tissues also include organs, tumors, lymph 
nodes, arteries and individual cell(s). 

As used herein: stringency of hybridization in determining percentage 
mismatch is as follows: 
10 D high stringency: 0.1 x SSPE, 0.1% SDS, 65 °C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1.0 x SSPE, 0.1 % SDS, 50°C 

Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g., Sambrook, E.F. 
15 Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring 
Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 
that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.18 M NaCI. Further, those of skill in the art recognize that the 
stability of hybrids is determined by T m , which is a function of the sodium ion 
20 concentration and temperature (T m = 81 .5° C-1 6.6(log 10 [Na + ]) + 0.41{%G + C)- 
600/I)), so that the only parameters in the wash conditions critical to hybrid 
stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
25 limitation, procedures using conditions of low stringency are as follows (see also 
Shilo and Weinberg, Proc. Natl. Acad. Scl. USA 75:6789-6792 (1981)): Filters 
containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.1 % PVP, 0.1% 
Ficoll, 1% BSA, and 500 //g/ml denatured salmon sperm DNA (10X SSC is 1.5 
30 M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 

Hybridizations are carried out in the same solution with the following 
modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100pg/ml salmon sperm 
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DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 e cpm 32 P-labeled probe is 
used. Filters are incubated in hybridization mixture for 18-20 hours at 40 °C, 
and then washed for 1.5 hours at 55°C in a solution containing 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash solution is replaced 
5 with fresh solution and incubated an additional 1.5 hours at 60°C. Filters are 
blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68°C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art (e.g., as employed for 
cross-species hybridizations). 
1 0 B Y way of example and not way of limitation, procedures using 

conditions of moderate stringency include, for example, but are not limited to, 
procedures using such conditions of moderate stringency are as follows; Filters 
containing DNA are pretreated for 6 hours at 55 °C in a solution containing 6X 
SSC, 5X Denhart's solution, 0.5% SDS and 100/yg/ml denatured salmon sperm 
15 DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 cpm 

32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 
hours at 55 °C, and then washed twice for 30 minutes at 60 °C in a solution 
containing 1X SSC and 0.1 % SDS. Filters are blotted dry and exposed for 
autoradiography. Other conditions of moderate stringency which can be used 
20 are well-known in the art. Washing of filters is done at 37°C for 1 hour in a 
solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using conditions 
of high stringency are as follows: Prehybridization of filters containing DNA is 
carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 
25 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 
and 500/yg/ml denatured salmon sperm DNA. Filters are hybridized for 48 hours 
at 65 °C in prehybridization mixture containing 100 //g/ml denatured salmon 
sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01 % PVP, 0.01 % Ficoll, 
30 and 0.01 % BSA. This is followed by a wash in 0.1X SSC at 50°C for 45 

minutes before autoradiography. Other conditions of high stringency which can 
be used are well known in the art. 
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The term substantially identical or substantially homologous or similar 
varies with the context as understood by those skilled in the relevant art and 
generally means at least 60% or 70%, preferably means at least 80%, 85% or 
more preferably at least 90%, and most preferably at least 95% identity. 
5 As used herein, substantially identical to a product means sufficiently 

similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
appear free of readily detectable impurities as determined by standard methods 
10 of analysis, such as thin layer chromatography (TLC), gel electrophoresis and 

high performance liquid chromatography (HPLC), used by those of skill in the art 
to assess such purity, or sufficiently pure such that further purification would 
not detectably alter the physical and chemical properties, such as enzymatic and 
biological activities, of the substance. Methods for purification of the 
1 5 compounds to produce substantially chemically pure compounds are known to 
those of skill in the art. A substantially chemically pure compound can, 
however, be a mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 

As used herein, target cell refers to a cell that expresses an SP in vivo. 
20 As used herein, test substance (or test compound) refers to a chemically 

defined compound (e.g., organic molecules, inorganic molecules, 
organic/inorganic molecules, proteins, peptides, nucleic acids, oligonucleotides, 
lipids, polysaccharides, saccharides, or hybrids among these molecules such as 
glycoproteins, etc.) or mixtures of compounds {e.g., a library of test compounds, 
25 natural extracts or culture supernatants, etc.) whose effect on an SP, particularly 
a single chain form that includes the protease domain or a sufficient portion 
thereof for activity, as determined by an in vitro method, such as the assays 
provided herein, is tested. 

As used herein, the terms a therapeutic agent, therapeutic regimen, 
30 radioprotectant or chemotherapeutic mean conventional drugs and drug 
therapies, including vaccines, which are known to those skilled in the art. 
Radiotherapeutic agents are well known in the art. 
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As used herein, treatment means any manner in which the symptoms of a 
condition, disorder or disease are ameliorated or otherwise beneficially altered. 
Treatment also encompasses any pharmaceutical use of the compositions herein. 
As used herein, vector {or plasmid) refers to discrete elements that are 
5 used to introduce heterologous nucleic acid into cells for either expression or 
replication thereof. The vectors typically remain episomal, but can be designed 
to effect integration of a gene or portion thereof into a chromosome of the 
genome. Also contemplated are vectors that are artificial chromosomes, such as 
yeast artificial chromosomes and mammalian artificial chromosomes. Selection 
10 and use of such vehicles are well known to those of skill in the art. An 

expression vector includes vectors capable of expressing DNA that is operatively 
linked with regulatory sequences, such as promoter regions, that are capable of 
effecting expression of such DNA fragments. Thus, an expression vector refers 
to a recombinant DNA or RNA construct, such as a plasmid, a phage, 
15 recombinant virus or other vector that, upon introduction into an appropriate 
host cell, results in expression of the cloned DNA. Appropriate expression 
vectors are well known to those of skill in the art and include those that are 
replicable in eukaryotic cells and/or prokaryotic cells and those that remain 
episomal or those which integrate into the host cell genome. 
20 As used herein, protein binding sequence refers to a protein or peptide 

sequence or a portion of other macromolecules that is capable of specific binding 
to protein or peptide sequences generally, to a set of protein or peptide 
sequences or to a particular protein or peptide sequence. 

As used herein, metal binding sequence refers to a protein or peptide 
sequence that is capable of specific binding to metal ions generally, to a set of 
metal ions or to a particular metal ion. 

As used herein, a combination refers to any association between two or 
among more items. 



25 
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As used herein, a composition refers to any mixture. It can be a solution, 
a suspension, liquid, powder, a paste, aqueous, non-aqueous or any combination 
thereof. 

As used herein, fluid refers to any composition that can flow. Fluids thus 
5 encompass compositions that are in the form of semi-solids, pastes, solutions, 
aqueous mixtures, gels, lotions, creams and other such compositions. 

As used herein, a cellular extract refers to a preparation or fraction which 
is made from a lysed or disrupted cell. 

As used herein, an agent is said to be randomly selected when the agent 
10 is chosen randomly without considering the specific sequences involved in the 
association of a protein alone or with its associated substrates, binding partners, 
etc. Ah example of randomly selected agents is the use a chemical library or a 
peptide combinatorial library, or a growth broth of an organism or conditioned 
medium. 

15 As used herein, an agent is said to be rationally selected or designed 

when the agent is chosen on a non-random basis which takes into account the 
sequence of the target site and/or its conformation in connection with the 
agent's action. As described in the Examples, there are proposed binding sites 
for serine protease and (catalytic) sites in the protein having SEQ ID NO:2 or 
20 SEQ ID NO:4. Agents can be rationally selected or rationally designed by 
utilizing the peptide sequences that make up these sites. For example, a 
rationally selected peptide agent can be a peptide whose amino acid sequence is 
identical to the ATP or calmodulin binding sites or domains. 

For clarity of disclosure, and not by way of limitation, the detailed 
25 description is divided into the subsections that follow. 

B. MTSP25 polypeptides, mutelns, derivatives and analogs thereof 
MTSPs 

The MTSPs are a family of transmembrane serine proteases that are 
found in mammals and also other species. MTSPs are of interest because they 
30 appear to be expressed and/or activated at different levels in tumor cells from 
normal cells, or have functional activity that is different in tumor cells from 
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normal cells, such as by an alteration in a substrate therefor, or a cofactor or a 
receptor. 

The MTSPs share a number of common structural features including: a 
proteolytic extracellular C-termina! domain; a transmembrane domain, with a 
5 hydrophobic domain near the N-terminus; a short cytoplasmic domain; and a 
variable length stem region that may contain additional modular domains. The 
proteolytic domains share sequence homology including conserved His, Asp, and 
Ser residues necessary for catalytic activity that are present in conserved motifs. 
The MTSPs are normally synthesized as zymogens and can be activated to two- 
10 chain forms by cleavage. It is shown herein that a single chain proteolytic 

domain can function in vitro and, hence is useful in in vitro assays for identifying 
agents that modulate the activity of members of this family. 

For purposes herein, the protease domain of the MTSP does not have to 
result from activation cleavage, which produces a two chain activated product, 
15 but rather includes single chain polypeptides where the N-termini include the 
consensus sequence IVVGG, ilVGG, J VGLL, I ILGG, 4 IVQG or 4 IVNG I IASG 
or other such motif. Such polypeptides, although not the result of activation 
cleavage and not two-chain forms, exhibit proteolytic (catalytic) activity. These 
protease domain polypeptides are used in assays to screen for agents that 
20 modulate the activity of the MTSP25. Nucleic acid encoding the protease 

domain and upstream nucleic acid is set forth in SEQ ID No. 5; and the protease 
domain of MTSP25 is set forth as residues 1-237 in SEQ ID No. 6. 

The MTSP family is a target for therapeutic intervention and also some 
members can serve as diagnostic markers for tumor development, growth and/or 
25 progression. As discussed, the members of this family are involved in 

proteolytic processes that are implicated in tumor development, growth and/or 
progression. This implication is based upon their functions as proteolytic 
enzymes in processes related to ECM degradation and/or remodeling and 
activation of pro-growth factors, pro-hormones and/or pro-angiogenic 
30 compounds. In addition, their levels of expression or level of activation or their 
apparent activity resulting from substrate levels or alterations in substrates and 
levels thereof differs in tumor cells and non-tumor cells in the same tissue. 
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Simiiarly the level of co-factors or receptors for these proteases can vary 
between tumor and non-tumor cells. Hence, protocols and treatments that alter 
their activity, such as their proteolytic activities and roles in signal transduction, 
and/or their expression, such as by contacting them with a compound that 
5 modulates their activity and/or expression, could impact tumor devefopment, 
growth and/or progression. Also, in some instances, the level of activation 
and/or expression can be altered in tumors, such as lung carcinoma, colon 
adenocarcinoma and ovarian carcinoma. 

The following discussion provides an overview and some details of the 
10 exemplified MTSP25s. 
MTSP25 

MTSP25 is of interest because it is expressed or is active in tumor cells. 
The MTSP provided herein can serve as a diagnostic marker for particular 
tumors, by virtue of a level of activity and/or expression or function in a subject 
15 {i.e. a mammal, particularly a human) with neoplastic disease, compared to a 
subject or subjects that do not have the neoplastic disease. In addition, 
detection of activity (and/or expression) in a particular tissue can be indicative of 
neoplastic disease. It is shown herein, that MTSP25s provided herein 

are expressed and/or activated in certain tumors; hence their activation or 
20 expression can serve as a diagnostic marker for tumor development, growth 

and/or progression. In other instances, the MTSP polypeptide can exhibit altered 
activity by virtue of a change in activity or expression of a co-factor, a substrate 
or a receptor. In addition, in some instances, these MTSPs and/or variants 
thereof can be shed from cell surfaces. Detection of the shed MTSPs, 
25 particularly the extracellular protease domains, in body fluids, such as serum, 
blood, saliva, cerebral spinal fluid, synovial fluid and interstitial fluids, urine, 
sweat and other such fluids and secretions, can serve as a diagnostic tumor 
marker. In particular, detection of higher levels of such shed polypeptides in a 
subject compared to a subject known not to have any neoplastic disease or 
30 compared to earlier samples from the same subject, can be indicative of 
neoplastic disease in the subject. 
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Polypeptides and muteins 

Provided herein are isolated substantially pure single chain and two chain 
polypeptides that contain the protease domain of an MTSP25. The polypeptides 
also can include other non-MTSP sequences of amino acids, but includes the 
5 protease domain or a sufficient portion thereof to exhibit catalytic activity in any 
in vitro assay that assess such protease activity, such as any provided herein. 

MTSP25 polypeptides provided herein are expressed or activated by or in 
tumor cells, typically at a level that differs from the level in which they are 
expressed by or activated in a non-tumor eel! of the same type. Hence, for 
10 example, if the MTSP is expressed in an cervical tumor cell, it is expressed or 
active at a different level from the level in non-tumor cervical cells. MTSP25 
expression or activation can be indicative of cervical, lung, esophogeal, colon, 
kidney, prostate, uterine, pancreatic, breast and other tumors. 

Isolated, substantially pure proteases that include protease domains or a 
1 5 catalytically active portion thereof are provided. Provided are single chain forms 
and two chain forms of the MTSP25. The protease domains can be included in a 
longer protein, and such longer protein is optionally the MTSP25 zymogen. 
Exemplary MTSP25-encoding nucleic acid and protein sequences of a protease 
domain are set forth in SEQ ID Nos. 5 and 6. .Full-length MTSP25-encoding 
20 nucleic acid molecules that contain the sequence set forth in SEQ ID No. 5 or 
SEQ ID No. 15 and polypeptides that include the sequence of amino acids set 
forth in SEQ ID No. 6 or SEQ ID No. 16 or catalytically active portions thereof 
are also provided herein. In particular, provided are two chain forms that include 
a shorter "A" chain linked to a longer "B" chain. The two chains forms include 
25 those in which the "A" chain contains at least amino acids 64-77 and can 
include up to amino acids 17-77, and the "B" chain contains acids 78-348, 
typically 78-323 (or truncated variants (78-313, 314, 31 5, 316... 322, or 
shorter variants that are catalytically active, following cleavage) and catalytically 
active fragments thereof. The single chain form of the MTSP25, which is 
30 proteolyticaily active, contains amino acids 78-348, typically 78-323, and 

catalytically active fragments thereof. The residues are made with reference to 
SEQ ID No. 16. Smaller portions thereof that retain protease activity are 
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contemplated. The polypeptides, which are Type I transmembrane serine 
proteases, contains a signal sequence at the N terminus, a transmembrane 
domains (TM) at the C terminus, and a serine protease domain. The N-terminal 
signal peptide can be cleaved. The C-terminal transmembrane domain spans 
5 residues 324-346 in the exemplified MTSP25. It is understood that variants of 
the MTSP25 as defined herein, including those in which amino acid 1 27 is Thr, 
are contemplated. 

Substantially purified MTSP25 protease is encoded by a nucleic acid that 
hybridizes to a nucleic acid molecule containing the protease domain encoded by 
10 the sequence of nucleotides that encodes the residues forth in SEQ. ID No. 6 {or 
catalytically active fragments thereof) under at least moderate, generally high, 
stringency conditions, such that the protease domain encoding nucleic acid 
thereof hybridizes along its full-length or at least 70%, 80% or 90% of the full- 
length thereof. 

1 5 Also included are substantially purified MTSP25 zymogens, activated two 

chain forms, single chain protease domains and two chain protease domains as 
described above. These polypeptides are encoded by nucleic acids that include 
sequences encoding a protease domain that exhibits proteolytic activity and that 
is encoded by nucleic acid molecules that hybridizes to a nucleic acid molecule 
20 having a nucleotide sequence set forth in SEQ ID No. 5 or SEQ ID No. 16 (or 

degenerate variant sequences thereof), typically under moderate, generally under 
high stringency, conditions and generally along the full-length or along at least 
about 70%, 80% or 90% of the full-length (or substantially the full-length) of the 
protease domain. Splice variants are also contemplated herein. 
25 Structural features 

MTSP25 includes several domains in addition to a catalytic domain 
residues (e.g., residues 78-314, SEQ ID No, 16; corresponding to residues 1-237 
of SEQ ID No. 6) and a transmembrane domain. The sequence set forth in SEQ 
ID No. 16 includes a signal peptide sequence (residues 1-16 SEQ ID No. 16). A 
30 C-terminal transmembrane domain (aa 324-aa 346 SEQ ID No. 16), occurs at the 
carboxy terminus of MTSP25. 
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MTSP25 contains a trypsin-like serine protease domain (residues 78-323 
(or truncated variants (78-313, 314, 315, 316 . . . 322, or shorter variants that 
are catalytically active, following cleavage) following cleavage) and has a 
protease activation cleavage site (...R 77 II 78 IGG..., where I indicates a protease 
5 activation/cleavage site) at the amino terminus of the domain and catalytic triad 
residues (H 122 , D 171 and S 26a ) in three highly-conserved regions. In addition, two 
other domains, a signal peptide sequence (aa 1-aa 16 SEQ ID No. 16) and a C- 
terminal transmembrane domain (aa 324-aa 346 SEQ ID No. 17), occur at the 
amino and carboxy termini, respectively, of MTSP25. There are 2 potential N- 
10 glycosylation sites: N 219 AT and N 249 TS. Cys 191 , which is in the protease domain, 
binds to Cys 64 outside the domain, and is unpaired in the single chain form of the 
protease domain, and, as described above, paired in two chain forms. Within 
the serine protease domain, the following cysteine pairings occur: C 107 -C 123 , C 206 - 
C 274' C 237 -C 253 , and C 264 -C 294 . One cysteine (C 340 ) within the C-terminal 
15 transmembrane domain is apparently unpaired. Alignment (e.g., blastp; 

http://www.ncbi.nlm.nih.gov/BLAST) of MTSP25 protein sequence with that of 
enterokinase (GenBank accession number U09860) indicates that the two 
proteins share 41 % sequence identity in their protease domains. 

As noted the protease also can be provided as a two chain molecule. 
20 Single chain and two chain forms can be proteolytically active. A two chain 
form of the polypeptide set forth in SEQ ID No. 16 {this form includes residues 
64-313, generally at least residues 17-313 (or residues 78-323, or truncated 
variants (78-313, 31 4, 31 5, 31 6 .. . 322, or shorter variants that are 
catalytically active, following cleavage), and catalytically active fragments 
25 thereof) is provided; smaller catalytically active two chain forms are also 
provided. As described above, a two chain form is produced by bonding, 
typically between the C 191 and a Cys outside the protease domain, such as 
Cys 64 . Upon activation cleavage the bond remains resulting in a two chain 
polypeptide. The size of the both chains is a function of the starting length of 
30 the polypeptide prior to activation cleavage between the R 77 and l 7a . Any length 
polypeptide that includes the protease domain (residues 78-323 (or truncated 
variants (78-313, 314, 315, 316 . . . 322, or shorter variants that are 
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catalytically active, following cleavage) of SEQ ID No. 16) or catalytically active 
fragments thereof, is contemplated herein. Two chain forms include at least the 
protease domain of a polypeptide from C 64 up to and including C 19v 

Protease domains 

5 MTSP protease domains include the single chain protease domains of 

MTSP25. Provided are the protease domains or proteins that include a portion 
of an MTSP that is the protease domain of any MTSP, particularly a MTSP25. 
The protein can also include other non-MTSP sequences of amino acids, but 
includes the protease domain or a sufficient portion thereof to exhibit catalytic 
10 and/or binding activity in any in vitro assay that assesses such protease or 
binding activity, such as any provided herein. Also provided are two chain 
activated forms of the full length protease and also, two chain forms of the 
protease domain. Thus, isolated, substantially pure proteases that include the 
protease domains or catalytically active portions thereof as single chain forms of 
1 5 SPs are provided. The protease domains can be included in a longer protein, and 
such longer protein is optionally the activated MTSP25 protein, up to and 
including a full-length, or an MTSP25 zymogen. 

In particular, exemplary protease domains include at least a sufficient 
portion of sequences of amino acids set forth of SEQ ID No. 6 to retain catalytic 
20 activity in vitro. As noted, the protease domains of an MTSP are single-chain 
polypeptides or two-chain polypeptides, with an N-terminus (such as IV, VV, IL 
and II) generated at the cleavage site (generally having the consensus sequence 
FUVVGG, FUIVGG, FUIVQ, FUIVNG, FUILGG, FUVGLL, FUILGG or a variation 
thereof; an N-terminus FUV or FUI, where the arrow represents the cleavage 
25 point) when the zymogen is activated. The protease domain of an exemplary 

MTSP25, produced is produced by activation cleavage between R 77 and the l 78 of 
SEQ ID No. 16 (Rt I) includes the sequence FUIIGGT, as set forth in SEQ ID No. 
16. A single chain form includes residues of SEQ ID No. 6 or catalytically active 
fragments thereof. Hence any length single chain polypeptide that includes the 
30 protease domain (residues 78-323 (or truncated variants (78-313, 314, 315, 

316 . . . 322, or shorter variants that are catalytically active, following cleavage) 
SEQ ID No. 16) or catalytically active fragments thereof, is contemplated herein. 
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Two chain forms also are provided and include at least a polypeptide from C 64 up 
to and including C 19l of SEQ ID No. 1 6 or corresponding residues of an MTSP25 
that has at least 60%, 70%, 80%, 90%, or 95% sequence identity there with 
and/or is: 

5 a) a single-chain polypeptide that includes the sequence of amino 

acids set forth as amino acids 1-236 or 1-237 in SEQ ID No. 6 or a catalytically 
active fragment or portion thereof; 

b) a two-chain polypeptide that includes the sequence of amino 
acids set forth in SEQ ID No. 1 6 or a catalytically active fragment or portion 

10 thereof; 

c) a single-chain polypeptide encoded by a sequence of 
nucleotides that hybridizes along at least 70%, 80%, 90% or 95% of its full- 
length under conditions of low, moderate or high stringency to the sequence of 
nucleotides set forth in SEQ ID No. 5 as nucleotides 1-711 (or to degenerate 

1 5 variants of the sequence), or a catalytically active fragment or portion of the 
polypeptide; 

d) a two-chain polypeptide encoded by a sequence of nucleotides 
• that hybridizes along at least. 70%, 80%, 90% or 95% of its full-length under 

conditions of low, moderate or high stringency to the sequence of nucleotides 
20 set forth in SEQ ID No. 5 as nucleotides 1-71 1 (or to degenerate variants of the 
■sequence), or a catalytically active fragment or portion of the polypeptide; 

e) a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 

25 sequence identity with the sequence of amino acids set forth as amino acids 1- 
237 in SEQ ID No. 6 or SEQ ID No. 16; and/or 

f) a protease domain of a polypeptide encoded by a splice variant 
of a sequence of nucleotides that encodes an MTSP25 of any of a)-e) or a splice 
variant of the sequence of nucleoties MTSP25 set forth in SEQ ID No. 15. 

30 A,so provided are polypeptides that are encoded by nucleic acid 

molecules that meet criteria specified below as (a)-(f). 



i 
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Thus, among the polypeptides provided herein, are: 

a) single and two-chain polypeptides containing, as the only MTSP25 
portion thereof, amino acids 17-323 (or truncated variants (78-313, 314, 315, 
316 .. . 322, or shorter variants that are catalytically active, following cleavage) 

5 or catalytically active portions thereof; " 

b) activated single and two chain forms of polypeptides that contain 
amino acids 17-348, 17-313, 17-314, 17-323, 64-323, 64-191, 64-313, 64- 
314, 78-313, 78-314, 78-323, 78-348, 324-346 or 324-348 of SEQ ID No. 16 
or catalytically active portions thereof; 

10 c) activated single and two chain polypeptides that have a sequence of 

amino acids having at least about 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98% or 99% with polypeptides of a) or b); and 

d) polypeptides encoded by sequences of nucleic acids that hybridize 

1 5 along at least 70%, 80%, 90% or 95% of the full-length under conditions of 
low, moderate or high stringency to a sequence of nucleotides encoding 
polyeptides a), b) or c). 

Muteins and derivatives 
Full-length MTSP25, zymogen and activated forms thereof and MTSP25 

20 protease domains, portions thereof, and muteins and derivatives of such 

polypeptides are provided. The domains, fragments, derivatives or analogs of an 
MTSP25 that are functionally active are capable of exhibiting one or more 
functional activities associated with the MTSP25 polypeptide, such as serine 
protease activity, immunogenicity and antigenicity, are provided. 

25 Among the derivatives are those based on animal MTSP25s, including, 

but are not limited to, rodent, such as mouse and rat; fowl, such as chicken; 
ruminants, such as goats, cows, deer, sheep; ovine, such as pigs; and humans. 
For example, MTSP25 derivatives can be made by altering their sequences by 
substitutions, additions or deletions. MTSP25 derivatives include, but are not 

30 limited to, those containing, as a primary amino acid sequence, all or part of the 
amino acid sequence of MTSP25, including altered sequences in which 
functionally equivalent amino acid residues are substituted for residues within 
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the sequence resulting in a silent change. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of a 
similar polarity which acts as a functional equivalent, resulting in a silent 
alteration. Substitutes for an amino acid within the sequence can be selected 
5 from other members of the class to which the amino acid belongs. For example, 
the nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 
valine, proline, phenylalanine, tryptophan and methionine. The polar neutral 
amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glutamine. The positively charged (basic) amino acids include arginine, 
10 lysine and histidine. The negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid (see, e.g., Table 1). 

Muteins of the MTSP25 or a domain thereof, such as a protease domain, 
in which up to about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 
81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 
1 5 94%, 95%, 96%, 97%, 98% or 99% of the amino acids are replaced with 

another amino acid are provided.. Generally such muteins retain at least about 
1%, 2%, 3,%, 5%, 7%, 8%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 
90%, 95% or more (or in increased activity, i.e., 101, 102, 103, 104, 105, 
1 10% or greater) of the protease activity of the unmutated protein. Those of 
20 skill in the art recognize that a polypeptide that retains at least 1 % of the activity 
of the wild-type protease is sufficiently active for use in screening assays or for 
other applications. 

Included among the polypeptides provided herein are those that contain 
only an MTSP25 protease domain as a single or two chain form or a polypeptide 
25 with amino acid changes such that the specificity and protease activity remains 
substantially unchanged or changed (increased or decreased) by a specified 
percentage, such as 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 
85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% with 
the MTSP protease domain exemplified herein. 
30 Also provided is a substantially purified two-chain polypeptide containing 

a sequence of amino acids that has at least 60%, 70%, 75%, 80%, 81 %, 82%, 
83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 
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96%, 97%, 98% or 99% identity to the MTSP25 exemplified herein, particularly 
to two-chain polypeptides that contain amino acids corresponding to amino acids 
17-323 (or truncated variants (78-313, 314, 315, 316 . . . 348, or shorter 
variants that are catalytically active, following cleavage) or 17-346 of SEQ ID 
5 No. 16, and to catalytically active portios thereof, where the percentage identity 
is determined using standard algorithms and gap penalties that maximize the 
percentage identity. The human MTSP25 polypeptide is included, although other 
mammalian MTSP25 polypeptides are contemplated. The precise percentage of 
identity can be specified if needed. 

1 0 Muteins in which one or more of the Cys residues, particularly, a residue 

that is paired in the activated two chain form, but unpaired in the protease 
domain alone is/are replaced with any amino acid, typically, although not 
necessarily, a conservative amino acid residue, such as Ser, are contemplated. 
Muteins of MTSP25, particularly those in which Cys residues, such as the Cys 191 

1 5 in the single chain protease domain, is replaced with another amino acid, such as 
Ser, Gly or Ala, that does not eliminate the activity, are provided. Also 
provided are substantially purified MTSP25 polypeptides and functional domains 
thereof, including catalytically active domains and portions, that have at least 
about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 

20 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence 
identity with a protease domain that includes a sequence of amino acids set 
forth in as residues 1-327 in SEQ ID No. 6 or catalytically active fragments 
thereof. 

Muteins of the protein are also provided in which amino acids are 
25 replaced with other amino acids. Among the muteins are those in which the Cys 
residues, is/are replaced typically with a conservative amino acid residues, such 
as a serine. Such muteins are also provided herein. Muteins in which 10%, 
20%, 30%, 35%, 40%, 45%, 50% or more of the amino acids are replaced but 
the resulting polypeptide retains at least about 1 %, 2%, 3,%, 5%, 7%, 8%, 
30 10%, 20%, 30%, 35%, 40%, 45%, 50%, 60%, 70%, 80%, 90% or 95% of 

the catalytic activity as the unmodified form for the same substrate are provided. 
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Muteins can be made by making conservative amino acid substitutions 
and also non-conservative amino acid substitutions. For example, amino acid 
substitutions that desirably alter properties of the proteins can be made. In one 
embodiment, mutations that prevent degradation of the polypeptide can be 
5 made. Many proteases cleave after basic residues, such as R and K; to eliminate 
such cleavage, the basic residue is replaced with a non-basic residue. Also, non- 
conservative changes at amino acids outside of the protease domain can be 
effected without altering protease activity. Non-conservative changes at amino 
acids that are responsible for activities other than protease activity may be 

10 desirable. For example, interaction of the protease with an inhibitor can be 

blocked while retaining catalytic activity by effecting a non-conservative change 
at the site interaction of the inhibitor with the protease. Similarly, receptor 
binding can be altered without altering catalytic activity by effecting a non- 
conservative or conservative change at a site of interaction of the receptor with 

1 5 the protease. 

Antigenic epitopes that contain at least 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 
14, 15, 20, 25, 30, 40, 50, and typically 10-15 amino acids of the MTSP25 
polypeptide are provided. These antigenic epitopes are used, for example, to 
raise antibodies. Antibodies specific for each epitope or combinations thereof 

20 and for single and two-chain forms are also provided. In particular, the 

antibodies typically are selected to specifically bind to the activated single-chain 

or activated two-chain forms of the protease domain or to a full-length activated 

two-chain form, but not to the full-length zymogen form of an MTSP25. 

Nucleic acid molecules, vectors and plasmids, cells and expression of 
25 MTSP25 polypeptides 

Nucleic acid molecules 

Due to the degeneracy of nucleotide coding sequences, other nucleic 
sequences which encode substantially the same amino acid sequence as a MTSP 
are contemplated. These include but are not limited to nucleic acid molecules 
30 that include all or portions of MTSP25-encoding genes that are altered by the 
substitution of different codons that encode the amino acid residue within the 
sequence, thus producing a silent change. 
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Mucleic acids 

Also provided herein are nucleic acid molecules that encode MTSP25 
polypeptides and the encoded proteins. In particular, nucleic acid molecules 
encoding MTSP25 from animals, including splice variants thereof are provided. 
5 The encoded proteins are also provided. Also provided are functional domains 
thereof. For each of the nucleic acid molecules provided, the nucleic acid can be 
DNA or RIMA or PNA or other nucleic acid analogs or can include non-natural 
nucleotide bases. Also provided are isolated nucleic acid molecules that include 
a sequence of nucleotides complementary to the nucleotide sequence encoding 
10 an MTSP. 

Also provided are nucleic acid molecules that encode single chain or two 
chain MTSP proteases that have proteolytic activity in an in vitro proteolysis 
assay and that have at least 60%, 70%, 75%, 80%, 81 %, 82%, 83%, 84%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
15 98% or 99% sequence identity with the full-length of a protease domain of an 
MTSP25 polypeptide, or that hybridize along their full-length or along at least 

■ 

about 70%, 80% or 90% of the full-length nucleic acid to a nucleic acids that 
encode a protease domain, particularly under conditions of moderate, generally 
high, stringency. As above, the encoded polypeptides contain the protease as a 

20 single chain; activated forms thereof can be produced and are provided. 

In- one embodiment, a nucleic acid molecule that encodes an MTSP, 
designated MTSP25 is provided. The nucleic acid molecule includes the open 
reading frame in the sequence of nucleotides set forth in SEQ ID No. 5 or SEQ ID 
No. 15. Also provided are nucleic acid molecules that hybridize under conditions 

25 of at least low stringency, moderate stringency, and generally high stringency to 
the following sequence of nucleic acids {SEQ ID No. 5) particularly to the open 
reading frame encompassed by nucleotides that encode a single protease domain 
thereof, or any domain of MTSP25. 

In certain embodiments, the isolated nucleic acid fragment hybridizes to 

30 the nucleic acid having the nucleotide sequence set forth in SEQ ID No. 5 under 
high stringency conditions, and generally contains the sequence of nucleotides 
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set forth as nucleotides 1-690 in SEQ ID No. 5. The protein contains a 
transmembrane domain 0~M) and a serine protease domain. 

Also provided, are muteins of the nucleic acid molecules that encode 
polypeptides in which amino acids are replaced with other amino acids. Among 
5 the muteins are those in which the Cys residue-encoding codons, is/are replaced 
with other amino acid residues, such as a codon encoding a serine. Such 
muteins are also provided herein. Each of such domains is provided herein as 
are nucleic acid molecules that include sequences of nucleotides encoding such 
domains. Some MTSPs can additionally include a CUB domain, LDLR domain, a 

10 scavenger-receptor cysteine rich (SRCR) domain and other domains. 

The isolated nucleic acid fragment is DNA, including genomic or cDNA, or 
is RNA, or can include other components, such as peptide nucleic acid (PNA) 
and other nucleotide analogs. The isolated nucleic acid can include additional 
components, such as heterologous or native promoters, and other transcriptional 

1 5 and translational regulatory sequences, and these genes can be linked to other 
genes, such as reporter genes or other indicator genes or genes that encode 
indicators. 

Also provided are nucleic acid molecules that hybridize to the above- 
noted sequences of nucleotides encoding MTSP25 at least at low stringency, 

20 moderate stringency, and typically at high stringency, and that encode the 
protease domain and/or the full-length protein or at least 60%, 70%, 80% or 
90% of the full-length protease domain or other domains of an MTSP25. or a 
splice variant or allelic variant thereof. Generally the molecules hybridize under 
such conditions along their full-length or along at least 60%, 70%, 80% or 90% 

25 of the full-length for at least one domain and encode at least one domain, such 
as the protease or extracellular domain, of the polypeptide. In particular, such 
nucleic acid molecules include any isolated nucleic acid fragment that encodes at 
least one domain of a membrane serine protease, that (1) contains a sequence of 
nucleotides that encodes the protease or a domain thereof, and (2) is selected 

30 from among, for example: 
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(a) a sequence of nucleotides that encodes the protease domain of an 
MTSP25 that includes a sequence of nucleotides that encodes 
amino acids 1-237 in SEQ ID No. 6 or amino acids 17-313, 17- 
314, 64-191, 64-313, 64-314, 78-313 or 314 (or residues 78- 
5 323, or truncated variants (78-313, 314, 315, 316 . . . 322, or 

shorter variants that are catalytically active, following cleavage)) in 
SEQ ID No. 16 or catalytically active portions thereof, and 
particularly to sequences of nucleotides that do not encode a full- 
length MTSP25; 

10 (b) a sequence of nucleotides that encodes such portion and 

hybridizes^ under conditions of high stringency, generally to nucleic 
acid that is complementary to a mRNA transcript present in a 
mammalian cell that encodes such protein or fragment thereof; 

(c) a sequence of nucleotides that encodes the protease or a domain 

1 5 thereof that includes a sequence of nucleotides set having at least 

about 60%, 70%, 80%, 90% or 95% sequence identity with the 
sequence set forth as nucleotides encoding amino acids 1 7-346 in 
SEQ ID No. 16; 

(d) a sequence of nucleotides that encodes the protease or a domain 
20 thereof that includes a sequence of nucleotides set having at least 

about 60%, 70%, 80%, 90% or 95% sequence identity with the 
sequence set forth in as nucleotides 1-711 in SEQ ID No. 5 or with 
a sequence of nucleotides in any of a)-c); 
(f) a sequence of nucleotides that encodes a transmembrane protease 
25 of a)-d) that includes a sequence of amino acids encoded by a 

sequence of nucleotides that encodes such subunit and hybridizes 
under conditions of low, moderate or high stringency to DNA that 
is complementary to the mRNA transcript. 
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The isolated nucleic acids can contain least 10 nucleotides, 25 
nucleotides, 50 nucleotides, 100 nucleotides, 150 nucleotides, or 200 
nucleotides or more contiguous nucleotides of an MTSP25-encoding sequence, 
or a full-length SP coding sequence. In another embodiment, the nucleic acids 
5 are smaller than 35, 200 or 500 nucleotides in length. Nucleic acids that 

hybridize to or are complementary to an MTSP25-encoding nucleic acid molecule 
can be single or double-stranded. For example, nucleic acids are provided that 

include a sequence complementary to (specifically are the inverse complement 

» 

of) at least 10, 25, 50, 100, or 200 nucleotides or the entire coding region of an 
10 MTSP25 encoding nucleic acid, particularly the protease domain thereof. For 
MTSP25 the full-length protein or a domain or active fragment thereof is also 
provided. 

Probes, primers, antisense oligonucleotides and dsRNA 
Also provided are fragments thereof that can be used as probes or 

15 primers and that contain at least about 10 nucleotides, 14 nucleotides, generally 
at least about 16 nucleotides, often at least about 30 nucleotides. The length of 
the probe or primer is a function of the size of the genome probed; the larger the 
genome, the longer the probe or primer required for specific hybridization to a 
single site. Those of skill in the art can select appropriately sized probes and 

20 primers. Generally probes and primers as described are single-stranded. Double 
stranded probes and primers can be used, if they are denatured when used. 

Probes and primers derived from the nucleic acid molecules are provided. 
Such probes and primers contain at least 8, 14, 16, 30, 100 or more contiguous 
nucleotides with identity to contiguous nucleotides of an MTSP25, and probes of 

25 at least 8, 14, 16, 30, 50 or 100 contiguous sequence of nucleotides of SEQ ID 
No. 5 or SEQ ID No. 15. The probes and primers are optionally labelled with a 
detectable label, such as a radiolabel or a fluorescent tag, or can be mass 
differentiated for detection by mass spectrometry or other means. 
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Also provided is an isolated nucleic acid molecule that includes the 
sequence of molecules that is complementary to the nucleotide sequence 
encoding MTSP25 or the portion thereof. Antisense, and double-stranded RNA 
(dsRNA), such as RNAi is also provided. 
5 Plasmids, Vectors and Cells 

Plasmids and vectors containing the nucleic acid molecules are also 
provided. Cells containing the vectors, including cells that express the encoded 
proteins are provided. The cell can be a bacterial cell, a yeast cell, a fungal cell, 
a plant cell, an insect eel! or an animal cell. Methods for producing an MTSP or 
10 single chain form of the protease domain thereof by, for example, growing the 
cell under conditions whereby the encoded MTSP is expressed by the cell, and 
recovering the expressed protein, are provided herein. As noted, for MTSP25, 
the activated (two chain) protease and single chain and two chain protease 
domains are provided. As described herein, the cells are used for expression of 
15 the protein, which can be secreted or expressed in the cytoplasm. 

As discussed below, the MTSP25 polypeptide, and catalytically active 
portions thereof, can be expressed on the surface of a cell. In addition, all or 
portions thereof can be expressed as a secreted protein using the native signal 
sequence or a heterologous signal. Alternatively, all or portions of the 
20 polypeptide can be expressed as inclusion bodies in the cytoplasm and isolated 
therefrom. The resulting protein can be treated to refold if necessary. 
C. Tumor specificity and tissue expression profiles 

MTSPs are of interest because they appear to be expressed and/or 
activated at different levels in tumor cells from normal cells, or have functional 
25 activity that is different in tumor cells from normal cells, such as by an alteration 
in a substrate for the MTSP, or a cofactor or receptor of the MTSP. MTSP25 is 
of interest because it is expressed or is active in tumor cells. Hence the MTSPs 
provided herein can serve as diagnostic markers for certain tumors. 

Each MTSP has a characteristic tissue expression profile; the MTSPs in 
30 particular, although not exclusively expressed or activated in tumors, exhibit 

characteristic tumor tissue expression or activation profiles. In some instances, 
MTSPs can have different activity in a tumor cell from a non-tumor cell by virtue 



WO 03/031585 



PCT/US02/32417 



-64- 

of a change in a substrate or cofactor or receptor therefor or other factor that 
would alter the functional activity of the MTSP. Hence each can serve as a 
diagnostic marker for particular tumors, by virtue of a level of activity and/or 
expression or function in a subject (i.e. a mammal, particularly a human) with 
5 neoplastic disease, compared to a subject or subjects that do not have the 
neoplastic disease, in addition, detection of activity (and/or expression) in a 
particular tissue can be indicative of neoplastic disease. Shed MTSPs in body 
fluids can be indicative of neoplastic disease. Also, by virtue of the activity 
and/or expression profiles of each, they can serve as therapeutic targets, such 
10 as by administration of modulators of the activity thereof, or, as by 

administration of a prodrug specifically activated by one of the MTSPs. 

Tissue expression profiles 
MTSP25 

MTSP25 transcript was expressed weakly in the lymph node. In the 
1 5 cancer profiling array analysis, MTSP25 is highly expressed in prostate samples 
(in normal and cancer samples). MTSP25 was highly expressed in a kidney 
ytumor sample, but not in its normal tissue counterpart. MTSP25 was also 
expressed in breast cancer samples, but not in its normal tissue counterpart. 
MTSP25 was expressed in normal uterus samples, but not in their tumor 
20 counterparts. MTSP25 expression was also observed in ovarian cancer samples. 
Among these three samples, the expression of MTSP25 was also detected in 
one of the matched normal tissue counterparts. MTSP25 expression was also 
detected in tumor samples in colon cDNA pairs. 

PCR analysis revealed that MTSP25 cDNA was strongly detected in testis 
25 and mammary gland adenocarcinoma, weakly detected in brain, placenta, lung, 
spleen, prostate, small intestine, colon, and leukocyte, and very weakly detected 
in heart, liver, and pancreas. 

D. Identification and isolation of MTSP25 polypeptide genes 

The MTSP polypeptides and/or domains thereof, can be obtained by 
30 methods well known in the art for protein purification and recombinant protein 
expression. Any method known to those of skill in the art for identification of 
nucleic acids that encode desired genes can be used. Any method available in 
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the art can be used to obtain a full-length [i.e., encompassing the entire coding 
region) cDNA or genomic DNA clone encoding an MTSP polypeptide. For 
example, the polymerase chain reaction (PCR) can be used to amplify a 
sequence that is expressed in normal and tumor cells or tissues, e.g., nucleic 
5 acids encoding an MTSP25 polypeptide (SEQ. Nos: 5 and 17), in a genomic or 
cDNA library. Oligonucleotide primers that hybridize to sequences at the 3' and 
5' termini of the identified sequences can be used as primers to amplify by PCR 
sequences from a nucleic acid sample (RNA or DNA), generally a cDNA library, 
from an appropriate source (e.g., tumor or cancer tissue). 
10 PCR can be carried out, e.g., by use of a Perkin-Elmer Cetus thermal 

cycler and Taq polymerase (Gene Amp'"). The DNA being amplified can include 
mRNA or cDNA or genomic DNA from any eukaryotic species. One can choose 
to synthesize several different degenerate primers, for use in the PCR reactions. 
It is also possible to vary the stringency of hybridization conditions used in 
15 priming the PCR reactions, to amplify nucleic acid homologs {e.g., to obtain 
MTSP polypeptide sequences from species other than humans or to obtain 
human sequences with homology to MTSP25 polypeptide) by allowing for 
greater or lesser degrees of nucleotide sequence similarity between the known 
nucleotide sequence and the nucleic acid homolog being isolated. For cross- 
20 species hybridization, low stringency to moderate stringency conditions are • 
used. For same species hybridization, moderately stringent to highly stringent 
conditions are used. The conditions can be empirically determined. 

After successful amplification of the nucleic acid encoding all or a portion 
of the identified MTSP polypeptide sequence or of a nucleic acid encoding all or 
25 a portion of an MTSP polypeptide homolog, that segment can be molecularly 
cloned and sequenced, and used to design primers or as a probe to isolate a 
complete cDNA or genomic clone. This, in turn, permits the determination of the 
gene's complete nucleotide sequence, the analysis of its expression, and, the 
production of its protein product for functional analysis. Once the nucleotide 
30 sequence is determined, an open reading frame encoding the MTSP polypeptide 
gene protein product can be determined by any method well known in the art for 
determining open reading frames, for example, using publicly available computer 
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programs for nucleotide sequence analysis. Once an open reading frame is 
defined, it is routine to determine the amino acid sequence of the protein 
encoded by the open reading frame. In this way, the nucleotide sequences of 
the entire MTSP polypeptide genes as well as the amino acid sequences of 
5 MTSP polypeptide proteins and analogs can be identified. 

Any eukaryotic cell potentially can serve as the nucleic acid source for 
the molecular cloning of the MTSP polypeptide gene. The nucleic acids can be 
isolated from vertebrate, mammalian, human, porcine, bovine, feline, avian, 
equine, canine, as well as additional primate sources, insects, plants and other 

10 organisms. The DNA can be obtained by standard procedures known in the art 
from cloned DNA {e.g., a DNA "library"), by chemical synthesis, by cDNA 
cloning, or by the cloning of genomic DNA, or fragments thereof, purified from 
the desired cell (see, e.g., Sambrook eta/. (2001) Molecular Cloning, A 
Laboratory Manual, 3d Ed., Cold Spring Harbor Laboratory Press, Cold Spring 

15 Harbor, New York; Glover, D.M. (ed.), 1985, DNA Cloning:, A Practical 
Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II). Clones derived from 
genomic DNA can contain regulatory and intron DNA regions in addition to 
coding regions; clones derived from cDNA will contain only exon sequences. For 
any source, the gene is cloned into a suitable vector for propagation thereof. 

20 In the molecular cloning of the gene from genomic DNA, DNA fragments 

are generated, some of which will encode the desired gene. 
The DNA can be cleaved at specific sites using various restriction enzymes. 
Alternatively, one can use DNAse in the presence of manganese to fragment the 
DNA, or the DNA can be physically sheared, for example, by sonication. The 

25 linear DNA fragments then can be separated according to size by standard 
techniques, including but not limited to, agarose and polyacrylarnide gel 
electrophoresis and column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired gene can be accomplished in a number of ways. 

30 For example, a portion of the MTSP polypeptide (of any species) gene (e.g., a 
PCR amplification product obtained as described above or an oligonucleotide 
having a sequence of a portion of the known nucleotide sequence) or its specific 
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RNA, or a fragment thereof can be purified and labeled, and the generated DNA 
fragments can be screened by nucleic acid hybridization to the labeled probe 
(Benton and Davis, Science 196 :180 (1977); Grunstein and Hogness, Proc. NatL 
Acad. Sci. U.S.A. 72:3961 (1975)). Those DNA fragments with substantial 
5 homology to the probe will hybridize. It is also possible to identify the 

appropriate fragment by restriction enzyme digestion(s) and comparison of 
fragment sizes with those expected according to a known restriction map if such 
is available or by DNA sequence analysis and comparison to the known 
nucleotide sequence of MTSP polypeptide. Further selection can be carried out 

10 on the basis of the properties of the gene. Alternatively, the presence of the 
gene can be detected by assays based on the physical, chemical, or 
immunological properties of its expressed product. For example, cDNA clones, 
or DNA clones which hybrid-select the proper mRNA, can be selected which 
produce a protein that, e.g., has similar or identical electrophoretic migration, 

15 isoelectric focusing behavior, proteolytic digestion maps, antigenic properties, 
serine protease activity. If an anti-MTSP polypeptide antibody is available, the 
protein can be identified by binding of labeled antibody to the MTSP polypeptide 
synthesizing clones, in an ELISA (enzyme-linked immunosorbent assay)-type 
procedure. 

20 Alternatives to isolating the MTSP25 polypeptide genomic DNA include, 

but are not limited to, chemically synthesizing the gene sequence from a known 
sequence or making cDNA to the mRNA that encodes the MTSP polypeptide. 
For example, RNA for cDNA cloning of the MTSP polypeptide gene can be 
isolated from cells expressing the protein. The identified and isolated nucleic 

25 acids then can be inserted into an appropriate cloning vector. A large number of 
vector-host systems known in the art can be used. Possible vectors include, but 
are not limited to, plasmids or modified viruses, but the vector system must be 
compatible with the host cell used. Such vectors include, but are not limited to, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 or pUC 

30 plasmid derivatives or the Bluescript vector (Stratagene, La Jolla, CA). The 

insertion into a cloning vector can, for example, be accomplished by ligating the 
DNA fragment into a cloning vector which has complementary cohesive termini. 
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lf the complementary restriction sites used to fragment the DNA are not present 
in the cloning vector, the ends of the DNA molecules can be enzymatically 
modified. Alternatively, any site desired can be produced by ligating nucleotide 
sequences (linkers) onto the DNA termini; these ligated linkers can include 
5 specific chemically synthesized oligonucleotides encoding restriction 

endonuclease recognition sequences. In an alternative method, the cleaved 
vector and MTSP polypeptide gene can be modified by homopolymeric tailing. 
Recombinant molecules can be introduced into host cells via transformation, 
transfection, infection, electroporation, calcium precipitation and other methods, 

10 so that many copies of the gene sequence are generated. 

In specific embodiments, transformation of host cells with recombinant 
DNA molecules that incorporate the isolated MTSP polypeptide gene, cDNA, or 
synthesized DNA sequence enables generation of multiple copies of the gene. 
Thus, the gene can be obtained in large quantities by growing transformants, 

1 5 isolating the recombinant DNA molecules from the transformants and, when 

necessary, retrieving the inserted gene from the isolated recombinant DNA. 

E. Vectors, plasmids and cells that contain nucleic acids encoding an MTSP 
polypeptide or protease domain thereof and expression of MTSP 
polypeptides 

20 Vectors and cells 

For recombinant expression of one or more of the MTSP polypeptides, the 
nucleic acid containing ail or a portion of the nucleotide sequence encoding the 
MTSP polypeptide can be inserted into an appropriate expression vector, i.e., a 
vector that contains the necessary elements for the transcription and translation 
25 of the inserted protein coding sequence. The necessary transcriptional and 
translational signals can also be supplied by the native promoter for MTSP 
genes, and/or their flanking regions. 

Also provided are vectors that contain nucleic acid encoding the MTSPs. 
Cells containing the vectors are also provided. The cells include eukaryotic and 
30 prokaryotic cells, and the vectors are any suitable for use therein. 

Prokaryotic and eukaryotic cells, including endothelial cells, containing the 
vectors are provided. Such cells include bacterial cells, yeast cells, fungal cells, 
plant cells, insect cells and animal cells. The cells are used to produce an MTSP 



WO 03/031585 



PCT/US02/32417 



-69- 

polypeptide or protease domain thereof by (a) growing the above-described cells 
under conditions whereby the encoded MTSP polypeptide or protease domain of 
the MTSP polypeptide is expressed by the cell, and then (b) recovering the 
expressed protease domain protein. In the exemplified embodiments, the 
5 protease domain is secreted into the medium. 

In one embodiment, vectors that include a sequence of nucleotides that 
encode a polypeptide that has protease activity and contains all or a portion of 
only the protease domain, or multiple copies thereof, of an SP protein are 
provided. Also provided are vectors that include a sequence of nucleotides that 
10 encodes the protease domain and additional portions of an SP protein up to and 
including a full length SP protein, as well as multiple copies thereof. The vectors 
can be selected for expression of the SP protein or protease domain thereof in 
the cell or such that the SP protein is expressed as a secreted protein. 
Alternatively, the vectors can include signals necessary for secretion of encoded 
15 proteins. When the protease domain is expressed the nucleic acid is linked to 

nucleic acid encoding a secretion signal, such as the Saccharomyces cerevisiae a 
mating factor signal sequence or a portion thereof, or the native signal sequence. 

A variety of host-vector systems can be used to express the protein 
coding sequence. These include but are not limited to mammalian cell systems 
20 infected with eukaryotic virus vectors (e.g. vaccinia virus, adenovirus, 

adenovirus-associated virus, SV40, herpes virus systems), insect cell systems 
infected with suitable expression vectors, such as baculovirus systems; 
microorganisms such as yeast containing yeast vectors; or bacteria transformed 
with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
25 elements of vectors vary in their strengths and specificities. Depending on the 
host-vector system used, any one of a number of suitable transcription and 
translation elements can be used. 

Any methods known to those of skill in the art for the insertion of nucleic 
acid fragments into a vector can be used to construct expression vectors 
30 containing a chimeric gene containing appropriate transcriptional/translational 
control signals and protein coding sequences. These methods can include in 
vitro recombinant DNA and synthetic techniques and in vivo recombinants 
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(genetic recombination). Expression of nucleic acid sequences encoding MTSP 
polypeptide, or domains, derivatives, fragments or homologs thereof, can be 
regulated by a second nucleic acid sequence so that the genes or fragments 
thereof are expressed in a host transformed with the recombinant DNA 
5 molecule(s). For example, expression of the proteins can be controlled by any 
promoter/enhancer known in the art. In a specific embodiment, the promoter is 
not native to the genes for MTSP polypeptide. Promoters which can be used 
include but are not limited to the SV40 early promoter (Bernoist and Chambon, 
Nature 290:304-310 (1981)), the promoter contained in the 3' long terminal 
10 repeat of Rous sarcoma virus (Yamamoto et al., Cell 22:787-797 (1980)), the 
herpes thymidine kinase promoter (Wagner et al., Proc. Natl. Acad. Sci. USA 
78:1441-1445 (1981)), the regulatory sequences of the metallothionein gene 
(Brinster et al., Na ture 296:39-42 (1982)); prokaryotic expression vectors such 
as the ^-lactamase promoter (Villa-Kamaroff et al., Proc. Natl. Acad. Sci. USA 
15 75:3727-3731 1978)) or the tac promoter (DeBoer et al., Proc. Nat/. Acad. Sci. 
USA 80:21-25 (1983)); see also "Useful Proteins from Recombinant Bacteria": in 
Scientific American 242:79-94 (1980)); plant expression vectors containing the 
nopaline synthetase promoter (Herrar-Estrelia et a!., Nature 303 :209-213 (1984)) 
or the cauliflower mosaic virus 35S RNA promoter (Garder et al., Nucleic Acids 
20 Res. 9:2871 (1981)), and the promoter of the photosynthetic enzyme ribulose 
bisphosphate carboxylase (Herrera-Estrella et al., Nature 310 :1 15-120 (1984)); 
promoter elements from yeast and other fungi such as the Gal4 promoter, the 
alcohol dehydrogenase promoter, the phosphoglycerol kinase promoter, the 
alkaline phosphatase promoter, and the following animal transcriptional control 
25 regions that exhibit tissue specificity and have been used in transgenic animals: 
elastase I gene control region which is active in pancreatic acinar cells (Swift et 
al., Cell 38:639-646 (1984); Ornitz et al., Cold Spring Harbor Symp. Quant. Biol. 
50:399-409 (1986); MacDonald, Hepatology 7:425-51 5 (1987)); insulin gene 
control region which is active in pancreatic beta cells (Hanahan et al.. Nature 
30 315:1 15-122 (1985)), immunoglobulin gene control region which is active in 
lymphoid cells (Grosschedl et al., Cell 38:647-658 (1984); Adams et al., Nature 
318:533-538 (1985); Alexander et al., Mot. Cell Biol. 7:1436-1444 (1987)), 
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mouse mammary tumor virus control region which is active in testicular, breast, 
lymphoid and mast cells (Leder et al.. Cell 45:485-495 (1986)), albumin gene 
control region which is active in liver (Pinckert et al.. Genes and Devel. 1:268- 
276 (1987)), alpha-fetoprotein gene control region which is active in liver 
5 (Krumlauf et al., Mol. Cell. Biol. 5:1 639-1 648 (1 985); Hammer et al., Science 
235:53-58 1987)), alpha-1 antitrypsin gene control region which is active in liver 
(Kelsey et al., Genes and Devel. 1:161-171 (1987)), beta globin gene control 
region which is active in myeloid cells (Mogram et al.. Nature 315 :338-340 
(1985); Kollias et al.. Cell 46:89-94 (1986)), myelin basic protein gene control 
10 region which is active in oligodendrocyte cells of the brain (Readhead et al.. Cell 
48=703-712 (1987)), myosin light chain-2 gene control region which is active in 
skeletal muscle (Sani, Nature 314:283-286 (1 985)), and gonadotropic releasing 
hormone gene control region which is active in gonadotrophs of the 
hypothalamus (Mason et al.. Science 234 :1372-1378 (1986)). 
15 ,n a specific embodiment, a vector is used that contains a promoter 

operably linked to nucleic acids encoding an MTSP polypeptide, or a domain, 
fragment, derivative or homolog, thereof, one or more origins of replication, and 
optionally, one or more selectable markers {e.g., an antibiotic resistance gene). 
Expression vectors containing the coding sequences, or portions thereof, of an 
20 MTSP polypeptide, is made, for example, by subcloning the coding portions into 
the EcoRI restriction site of each of the three pGEX vectors (glutathione S- 
trahsferase expression vectors (Smith and Johnson, Gene 7:31-40 (1988)). This 
allows for the expression of products in the correct reading frame. Exemplary 
vectors and systems for expression of the protease domains of the MTSP 
polypeptides include the well-known Pichia vectors (available, for example, from 
Invitrogen, San Diego, CA), particularly those designed for secretion of the 
encoded proteins. The protein can also be expressed cytoplasmically, such as in 
the inclusion bodies. One exemplary vector is described in the EXAMPLES. 

Plasmids for transformation of E. coli cells, include, for example, the pET 
30 expression vectors (see, U.S patent 4,952,496; available from NOVAGEN, 

Madison, Wl; see, also literature published by Novagen describing the system). 
Such plasmids include pET 1 1 a, which contains the T7lac promoter, T7 
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terminator, the inducible £ coli lac operator, and the lac repressor gene; pET 
12a-c, which contains the T7 promoter, 17 terminator, and the E. coli ompT 
secretion signal; and pET 15b and pET19b {NOVAGEN, Madison, Wl), which 
contain a His-Tag™ leader sequence for use in purification with a His column and 
5 a thrombin cleavage site that permits cleavage following purification over the 
column; the T7-lac promoter region and the T7 terminator. 

The vectors are introduced into host cells, such as Pichia cells and 
bacterial cells, such as E. coli, and the proteins expressed therein. Exemplary 
Pichia strains, include, for example, GS1 15. Exemplary bacterial hosts contain 
10 chromosomal copies of DNA encoding T7 RNA polymerase operably linked to an 
inducible promoter, such as the lacUV promoter (see, U.S. Patent No. 
4,952,496). Such hosts include, but are not limited to, the lysogenic E. coli 
strain BL21 (DE3). 

Expression and production of proteins 

1 5 The MTSP domains, derivatives and analogs can be produced by various 

methods known in the art. For example, once a recombinant cell expressing an 
MTSP polypeptide, or a domain, fragment or derivative thereof, is identified, the 
individual gene product can be isolated and analyzed. This is achieved by 
assays based on the physical and/or functional properties of the protein, 

20 including, but not limited to, radioactive labeling of the product followed by 
analysis by gel electrophoresis, immunoassay, cross-linking to marker-labeled 
product, and assays of proteolytic activity. 

The MTSP polypeptides can be isolated and purified by standard methods 
known in the art (either from natural sources or recombinant host cells 

25 expressing the complexes or proteins), including but not restricted to column 

chromatography (e.g., ion exchange, affinity, gel exclusion, reversed-phase high 
pressure and fast protein liquid), differential centrifugation, differential solubility, 
or by any other standard technique used for the purification of proteins. 
Functional properties can be evaluated using any suitable assay known in the 

30 art. 

Alternatively, once an MTSP polypeptide or its domain or derivative is 
identified, the amino acid sequence of the protein can be deduced from the 
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nucleotide sequence of the gene which encodes it. As a result, the protein or its 
domain or derivative can be synthesized by standard chemical methods known in 
the art (e.g. see Hunkapiller et al r Nature 310 :105-1 1 1 (1984)). 

Manipulations of MTSP polypeptide sequences can be made at the protein 
5 level. Also contemplated herein are MTSP polypeptide proteins, domains 
thereof, derivatives or analogs or fragments thereof, which are differentially 
modified during or after translation, e.g., by glycosylation, acetyiation, 
phosphorylation, amidation, derivatization by known protecting/blocking groups, 
proteolytic cleavage, linkage to an antibody molecule or other cellular ligand. 
10 Any of numerous chemical modifications can be carried out by known 

techniques, including but not limited to specific chemical cleavage by cyanogen 
bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH 4 , acetyiation, 
formylation, oxidation, reduction, metabolic synthesis in the presence of 
tunicamycin and other such agents. 
15 In addition, domains, analogs and derivatives of an MTSP polypeptide can 

be chemically synthesized. For example, a peptide corresponding to a portion of 
an MTSP polypeptide, which includes the desired domain or which mediates the 
desired activity in vitro can be synthesized by use of a peptide synthesizer. 
Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs 
20 can be introduced as a substitution or addition into the MTSP polypeptide 

• sequence. Non-classical amino acids include but are not limited to the D-isomers 
of the common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 
2-aminobutyric acid, e-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino 
isobutyric acid, 3-amino propionoic acid, ornithine, norleucine, norvaline, 
25 hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, B-alanine, fluoro-amino acids, designer amino 
acids such as B-methyl amino acids, Ca-methyl amino acids, Na-methyl amino 
acids, and amino acid analogs in general. Furthermore, the amino acid can be D 
(dextrorotary) or L (levorotary). 
30 ,n cases where natural products are suspected of being mutant or are 

isolated from new species, the amino acid sequence of the MTSP polypeptide 
isolated from the natural source, as well as those expressed in vitro, or from 
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synthesized expression vectors in vivo or in vitro, can be determined from 
analysis of the DNA sequence, or alternatively, by direct sequencing of the 
isolated protein. Such analysis can be performed by manual sequencing or 
through use of an automated amino acid sequenator. 
5 Modifications 

A variety of modifications of the MTSP polypeptides and domains are 
contemplated herein. An MTSP-encoding nucleic acid molecule can be modified 
by any of numerous strategies known in the art (Sambrook et al. (1990), 
Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, 

10 Cold Spring Harbor, New York). The sequences can be cleaved at appropriate 

sites with restriction endonuclease(s), followed by further enzymatic modification 
if desired, isolated, and ligated in vitro. In the production of the gene encoding a 
domain, derivative or analog of MTSP, care should be taken to ensure that the 
modified gene retains the original translational reading frame, uninterrupted by 

1 5 translational stop signals, in the gene region where the desired activity is 
encoded. 

Additionally, the MTSP-encoding nucleic acid molecules can be mutated 
in vitro or in vivo, to create and/or destroy translation, initiation, and/or 
termination sequences, or to create variations in coding regions and/or form new 

20 restriction endonuclease sites or destroy pre-existing ones, to facilitate further in 
vitro modification. Also, as described herein muteins with primary sequence 
alterations, such as replacements of Cys residues and elimination or addition of 
glycosylation sites are contemplated; the MTSP25 that includes the sequence of 
amino acids set forth in SEQ ID No. 6. MTSP25 has potential glycosylation 

25 sites at N 219 AT and N 249 TS (SEQ ID No. 16). Mutations can be effected by any 
technique for mutagenesis known in the art, including, but not limited to, 
chemical mutagenesis and in vitro site-directed mutagenesis (Hutchinson et al., 
J. Biol. Chem. 253:6551-6558 (1978)), use of TAB* linkers (Pharmacia). In one 
embodiment, for example, an MTSP polypeptide or domain thereof is modified to 

30 include a fluorescent label. In other specific embodiments, the MTSP 

polypeptide is modified such that heterobifunctional reagents can be used to 
crosslink the members of a complex. 
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F. SCREENING METHODS 

The single chain protease domains and two chain forms of the protein can 
be used in a variety of methods to identify compounds that modulate the activity 
thereof. For SPs that can exhibit higher activity or expression in tumor cells, 
5 compounds that inhibit the proteolytic activity are of particular interest. For any 
SPs that are active at lower levels in tumor cells, compounds or agents that 
enhance the activity are potentially of interest. In all instances the identified 
compounds include agents that are candidate cancer treatments. 

Several types of assays are exemplified and described herein. It is 
10 understood that the protease domains can be used in other assays. It is shown 
here, however, that the single chain protease domains exhibit catalytic activity. 
As such they are ideal for in vitro screening assays. They can also be used in 
binding assays. 

The MTSP25 full length zymogens, activated enzymes, single and two 

1 5 chain protease domains are contemplated for use in any screening assay known 

to those of skill in the art, including those provided herein. Other assays, such 

as binding assays are provided herein, particularly for use with an MTSP25, 

including any variants, such as splice variants thereof. 

1 . Catalytic Assays for identification of agents that modulate the 
20 protease activity of an SP protein 

Methods for identifying a modulator of the catalytic activity of an SP, 

particularly a single chain protease domain or catalytically active portion thereof, 

are provided herein. The methods can be practiced by: contacting the MTSP25, 

a full-length zymogen or activated form, and particularly a single-chain domain 

25 thereof, with a substrate of the MTSP25 in the presence of a test substance, 
and detecting the proteolysis of the substrate, whereby the activity of the 
MTSP25 is assessed, and comparing the activity to a control. For example, a 
control can be the activity of the MTSP25 assessed by contacting an MTSP25, 
including a full-length zymogen or activated form, and particularly a single-chain 

30 domain thereof, particularly a single-chain domain thereof, with a substrate of 

the MTSP25, and detecting the proteolysis of the substrate, whereby the activity 
of the MTSP25 is assessed. The results in the presence and absence of the test 



WO 03/031585 



PCT/US02/32417 



-76- 



compounds are compared. A difference in the activity indicates that the test 
substance modulates the activity of the MTSP25. Activators of MTSP25 
activation cleavage are also contemplated; such assays are discussed below. 
In one embodiment a plurality of the test substances are screened 
5 simultaneously in the above screening method. In another embodiment, the 

MTSP25 is isolated from a target cell as a means for then identifying agents that 
are potentially specific for the target cell. 

In another embodiment, a test substance is, for example, a therapeutic 
compound. In such assays, a difference of the MTSP25 activity or in a response 
10 to MTSP25, such as signal transductions, activity measured in the presence and 
in the absence of the test substance indicates that the target cell responds to 
the therapeutic compound. 

One method includes the steps of (a) contacting the MTSP25 polypeptide 
or protease domain thereof with one or a plurality of test compounds under 
15 conditions conducive to interaction between the ligand and the compounds; and 
(b) identifying one or more compounds in the plurality that specifically binds to 
the ligand. 

Another method provided herein includes the steps of a) contacting an 
MTSP25 polypeptide or protease domain thereof with a substrate of the 

20 MTSP25 polypeptide, and detecting the proteolysis of the substrate, whereby 
the activity of the MTSP25 polypeptide is assessed; b) contacting the MTSP25 
polypeptide with a substrate of the MTSP25 polypeptide in the presence of a 
test substance, and detecting the proteolysis of the substrate, whereby the 
activity of the MTSP25 polypeptide is assessed; and c) comparing the activity of 

25 the MTSP25 polypeptide assessed in steps a) and b), whereby the activity 

measured in step a) differs from the activity measured in step b) indicates that 
the test substance modulates the activity of the MTSP25 polypeptide. 

In another embodiment, a plurality of the test substances are screened 
simultaneously. In comparing the activity of an MTSP25 polypeptide in the 
30 presence and absence of a test substance to assess whether the test substance 
is a modulator of the MTSP25 polypeptide, it is unnecessary to assay the 
activity in parallel, although such parallel measurement is typical. It is possible 
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15 



to measure the activity of the MTSP25 polypeptide at one time point and 
compare the measured activity to a historical value of the activity of the 
MTSP25 polypeptide. 

For instance, one can measure the activity of the MTSP25 polypeptide in 
the presence of a test substance and compare with historical value of the 
activity of the MTSP25 polypeptide measured previously in the absence of the 
test substance, and vice versa. This can be accomplished, for example, by 
providing the activity of the MTSP25 polypeptide on an insert or pamphlet 
provided with a kit for conducting the assay. 

Methods for selecting substrates for a particular SP are described in the 
EXAMPLES, and particular proteolytic assays are exemplified. 

Combinations and kits containing the combinations optionally including 
instructions for performing the assays are provided. The combinations include 
an MTSP25 polypeptide and a substrate of the MTSP25 polypeptide to be 
assayed; and, optionally reagents for detecting proteolysis of the substrate. 
The substrates, which can be chromogenic or fluorogenic molecules, including 
proteins, subject to proteolysis by a particular MTSP25 polypeptide, can be 
identified empirically by testing the ability of the MTSP25 polypeptide to cleave 
the test substrate. Substrates that are cleaved most effectively (i.e., at the 
20 lowest concentrations and/or fastest rate or under desirable conditions), are 
identified. 

Additionally provided herein is a kit containing the above-described 
combination. The kit optionally includes instructions for identifying a modulator 
of the activity of an MTSP25 polypeptide. Any MTSP25 polypeptide is 
25 contemplated as a target for identifying modulators of the activity thereof. 

2. Binding assays 

Also provided herein are methods for identification and isolation of 
agents, particularly compounds that bind to MTSP25s. The assays are designed 
to identify agents that bind to the zymogen form, the single chain isolated 
30 protease domain (or a protein, other than an MTSP25 polypeptide, that contains 
the protease domain of an MTSP25 polypeptide), and to the activated form, 
including the activated form derived from the full length zymogen or from an 
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extended protease domain. The identified compounds are candidates or leads 
for identification of compounds for treatments of tumors and other disorders and 
diseases involving aberrant angiogenesis. The MTSP25 polypeptides used in 
the methods include any MTSP25 polypeptide as defined herein, including the 
5 MTSP25 single chain protease domain or proteolytically active portion thereof. 
A variety of methods are provided herein. These methods can be 
performed in solution or in solid phase reactions in which the MTSP25 
polypeptide(s) or protease domain(s) thereof are linked, either directly or 
indirectly via a linker, to a solid support. Screening assays are described in the 
10 Examples, and these assays are used to identify candidate compounds. For 
purposes herein, all binding assays described above are provided for MTSP25. 

Methods for identifying an agent, such as a compound, that specifically 
binds to an MTSP25 single and/or two chain protease domain, a zymogen and/or 
full-length activated MTSP25 or two chain protease domain thereof are provided 
15 herein. The method can be practiced by (a) contacting the MTSP25 with one or 
a plurality of test agents under conditions conducive to binding between the 
MTSP25 and an agent; and (b) identifying one or more agents within the plurality 
that specifically binds to the MTSP25. 

For example, in practicing such methods the MTSP25 polypeptide is 
20 mixed with a potential binding partner or an extract or fraction of a cell under 
conditions that allow the association of potential binding partners with the 
polypeptide. After mixing, peptides, polypeptides, proteins or other molecules 
that have become associated with an MTSP25 are separated from the mixture. 
The binding partner that bound to the MTSP25 can then be removed and further 
25 analyzed. To identify and isolate a binding partner, the entire protein, for 

instance the entire polypeptide of SEQ ID No. 6 can be used. Alternatively, a 
fragment of the protein can be used. 

A variety of methods can be used to obtain cell extracts or body fluids, 
such as blood, serum, urine, sweat, synovial fluid, CSF and other such fluids. 
30 For example, cells can be disrupted using either physical or chemical disruption 
methods. Examples of physical disruption methods include, but are not limited 
to, sonication and mechanical shearing. Examples of chemical lysis methods 
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include, but are not limited to, detergent lysis and enzyme lysis. A skilled artisan 
can readily adapt methods for preparing cellular extracts in order to obtain 
extracts for use in the present methods. 

Once an extract of a cell is prepared, the extract is mixed with the 
5 MTSP25 under conditions in which association of the protein with the binding 
partner can occur. A variety of conditions can be used, including conditions that 
resemble conditions found in the cytoplasm of a human cell or in a body fluid, 
such as blood. Features, such as osmolarity, pH, temperature, and the 
concentration of cellular extract used, can be varied to optimize the association 
1 0 of the protein with the binding partner. Similarly, methods for isolation of 
molecules of interest from body fluids are known. 

After mixing under appropriate conditions, the bound complex is 
separated from the mixture. A variety of techniques can be used to separate the 
mixture. For example, antibodies specific to an MTSP25 can be used to 
1 5 immunoprecipitate the binding partner complex. Alternatively, standard chemical 
separation techniques such as chromatography and density/sediment 
centrifugation can be used. 

After removing the non-associated cellular constituents in the extract, the 
binding partner can be dissociated from the complex using conventional 
20 methods. t=or example, dissociation can be accomplished by altering the salt 
concentration or pH of the mixture. 

To aid in separating associated binding partner pairs from the mixed 
extract, the MTSP25 can be immobilized on a solid support. For example, the 
protein can be attached to a nitrocellulose matrix or acrylic beads. Attachment 
25 of the protein or a fragment thereof to a solid support aids in separating 

peptide/binding partner pairs from other constituents found in the extract. The 
identified binding partners can be either a single protein or a complex made up of 
two or more proteins. 

Alternatively, the nucleic acid molecules encoding the single chain 
30 proteases can be used in a yeast two-hybrid system. The yeast two-hybrid 

system has been used to identify other protein partner pairs and can readily be 
adapted to employ the nucleic acid molecules herein described. 
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Another in vitro binding assay, particularly for an MTSP25, uses a 
mixture of a polypeptide that contains at least the catalytic domain of one of 
these proteins and one or more candidate binding targets or substrates. After 
incubating the mixture under appropriate conditions, the ability of the MTSP25 
5 or a polypeptide fragment thereof containing the catalytic domain to bind to or 
interact with the candidate substrate is assessed. For cell-free binding assays, 
one of the components includes or is coupled to a detectable label. The label 
can provide for direct detection, such as radioactivity, luminescence, optical or 
electron density, etc., or indirect detection such as an epitope tag, an enzyme, 

10 etc. A variety of methods can be employed to detect the label depending on the 
nature of the label and other assay components. For example, the label can be 
detected bound to the solid substrate or a portion of the bound complex 
containing the label can be separated from the solid substrate, and the label 
thereafter detected. 

15 3. Detection of signal transduction 

MTSP25, which is a transmembrane protein, can be involved directly or 
indirectly in signal transduction directly as a cell surface receptor or indirectly by 
activating proteins, such as pro-growth factors that can initiate signal 
transduction. 

20 In addition, secretion of MTSP25, such as the extracellular domain of 

MTSP25, can be involved in signal transduction either directly by binding to or 
interacting with a cell surface receptor or indirectly by activating proteins, such 
as pro-growth factors that can initiate signal transduction. Assays for assessing 
signal transduction are well known to those of skill in the art, and can be 

25 adapted for use with the MTSP25 polypeptide. 

Assays for identifying agents that affect or alter signal transduction 
mediated directly or indirectly, such as via activation of a pro-growth factor, by 
an MTSP25, particularly the full length or a sufficient portion to anchor the 
extracellular domain or a functional portion thereof of an MTSP25 on the surface 

30 of a cell are provided. Such assays, include, for example, transcription based 
assays in which modulation of a transduced signal is assessed by detecting an 
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effect on an expression from a reporter gene (see, e.g., U.S. Patent No. 
5,436,128). 

4. Methods for Identifying Agents that Modulate the Expression a 
Nucleic Acid Encoding an MTSP25 

5 Another embodiment provides methods for identifying agents that 

modulate the expression of a nucleic acid encoding an MTSP25. Such assays 

use any available means of monitoring for changes in the expression level of the 

nucleic acids encoding an MTSP25. 

In one assay format, cell lines that contain reporter gene fusions between 

10 the open reading frame of MTSP25 or a domain thereof, particularly the protease 
domain and any assayable fusion partner can be prepared. Numerous assayable 
fusion partners are known and readily available including the firefly luciferase 
gene and the gene encoding chloramphenicol acetyltransferase (Alam et a!., 
Anal. Biochem, 188: 245-54 (1990)). Cell lines containing the reporter gene 

1 5 fusions are then exposed to the agent to be tested under appropriate conditions 
and time. Differential expression of the reporter gene between samples exposed 
to the agent and control samples identifies agents which modulate the 
expression of a nucleic acid encoding an MTSP25. 

Additional assay formats can be used to monitor the ability of the agent 

20 to modulate the expression of a nucleic acid encoding an MTSP25. For 

instance, mRNA expression can be monitored directly by hybridization to the 
nucleic acids. Cell lines are exposed to the agent to be tested under appropriate 
conditions and time and total RNA or mRNA is isolated by standard procedures 
(see, e.g., Sambrook etal. (1989) MOLECULAR CLONING: A LABORATORY 

25 MANUAL, 2nd Ed. Cold Spring Harbor Laboratory Press). Probes to detect 
differences in RNA expression levels between cells exposed to the agent and 
control cells can be prepared from the nucleic acids. It is typical, but not 
necessary, to design probes which hybridize only with target nucleic acids under 
conditions of high stringency. Only highly complementary nucleic acid hybrids 

30 form under conditions of high stringency. Accordingly, the stringency of the 

assay conditions determines the amount of complementarity which should exist 
between two nucleic acid strands in order to form a hybrid. Stringency should 
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be chosen to maximize the difference in stability between the probertarget hybrid 
and potential probe:non-target hybrids. 

Probes can be designed from the nucleic acids through methods known in 
the art. For instance, the G + C content of the probe and the probe length can 
5 affect probe binding to its target sequence. Methods to optimize probe 
specificity are commonly available (see, e.g., Sambrook etal. (1989) 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. Cold Spring 
Harbor Laboratory Press); and Ausubel etal. (1995) CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, Greene Publishing Co., NY). 

10 Hybridization conditions are modified using known methods (see, e.g., 

Sambrook etal. (1989) MOLECULAR CLONING: A LABORATORY MANUAL, 
2nd Ed. Cold Spring Harbor Laboratory Press); and Ausubel et al. (1995) 
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Co., NY), 
as required for each probe. Hybridization of total cellular RNA or RNA enriched 

1 5 for polyA RNA can be accomplished in any available format. For instance, total 
cellular RNA or RNA enriched for polyA RNA can be affixed to a solid support, 
and the solid support exposed to at least one probe comprising at least one, or 
part of one of the nucleic acid molecules under conditions in which the probe 
specifically hybridizes. Alternatively, nucleic acid fragments comprising at least 

20 one, or part of one of the sequences can be affixed to a solid support, such as a 
porous glass wafer. The glass wafer can then be exposed to total cellular RNA 
or polyA RNA from a sample under conditions in which the affixed sequences 
specifically hybridize. Such glass wafers and hybridization methods are widely 
available, for example, those disclosed by Beattie (WO 95/1 1755). By 

25 examining for the ability of a given probe to specifically hybridize to an RNA 

sample from an untreated cell population and from a cell population exposed to 
the agent, agents which up or down regulate the expression of a nucleic acid 
encoding the MTSP25 polypeptide, are identified. 
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In one format, the relative amounts of a protein between a cell population 
that has been exposed to the agent to be tested compared to an un-exposed 
control cell population can be assayed {e.g., a prostate cancer cell line, a lung 
cancer cell line, a colon cancer cell line or a breast cancer cell line). In this 
5 format, probes, such as specific antibodies, are used to monitor the differential 
expression or level of activity of the protein in the different cell populations or 
body fluids. Cell lines or populations or body fluids are exposed to the agent to 
be tested under appropriate conditions and time. Cellular lysates or body fluids 
can be prepared from the exposed cell line or population and a control, 
10 unexposed cell line or population or unexposed body fluid. The cellular lysates 
or body fluids are then analyzed with the probe. 

For example, N- and C- terminal fragments of the MTSP25 can be 
expressed in bacteria and used to search for proteins which bind to these 
fragments. Fusion proteins, such as His-tag or GST fusion to the N- or C- 
15 terminal regions of the MTSP25 can be prepared for use as a substrate. These 
fusion proteins can be coupled to, for example, Glutathione-Sepharose beads and 
then probed with cell lysates or body fluids. Prior to lysis, the cells or body 
fluids can be treated with a candidate agent which can modulate an MTSP25 or 
proteins that interact with domains thereon. Lysate proteins binding to the 
20 fusion proteins can be resolved by SDS-PAGE, isolated and identified by protein 
sequencing or mass spectroscopy, as is known in the art. 

Antibody probes are prepared by immunizing suitable mammalian hosts in 
appropriate immunization protocols using the peptides, polypeptides or proteins 
if they are of sufficient length [e.g., 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 
25 25, 30, 35, 40 or more consecutive amino acids the MTSP25 polypeptide or if 
required to enhance immunogenicity, conjugated to suitable carriers. Methods 
for preparing immunogenic conjugates with carriers, such as bovine serum 
albumin (BSA), keyhole limpet hemocyanin (KLH), or other carrier proteins are 
well known in the art. In some circumstances, direct conjugation using, for 
30 example, carbodiimide reagents can be effective; in other instances linking 

reagents such as those supplied by Pierce Chemical Co., Rockford, IL, can be 
desirable to provide accessibility to the hapten. Hapten peptides can be 
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extended at either the amino or carboxy terminus with a Cys residue or 
interspersed with cysteine residues, for example, to facilitate linking to a carrier. 
Administration of the immunogens is conducted generally by injection over a 
suitable time period and with use of suitable adjuvants, as is generally 
5 understood in the art. During the immunization schedule, titers of antibodies are 
taken to determine adequacy of antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides 
corresponding to, for example, the carboxy terminal amino acids of the MTSP25. 
Synthetic peptides can be as small as 1-3 amino acids in length, generally at 
10 least 4 or more amino acid residues long. The peptides can be coupled to KLH 
using standard methods and can be immunized into animals, such as rabbits or 
ungulates. Polyclonal antibodies can then be purified, for example using Actigel 
beads containing the covalently bound peptide. 

While the polyclonal antisera produced in this way can be satisfactory for 

15 some applications, for pharmaceutical compositions, use of monoclonal 
preparations are generally used. Immortalized cell lines which secrete the 
desired monoclonal antibodies can be prepared using the standard method of 
Kohler et aL, (Nature 256: 495-7 (1975)) or modifications which effect 
immortalization of lymphocytes or spleen cells, as is generally known. The 

20 immortalized eel! lines secreting the desired antibodies are screened by 

immunoassay in which the antigen is the peptide hapten, polypeptide or protein. 
When the appropriate immortalized cell culture secreting the desired antibody is 
identified, the cells can be cultured either in vitro or by production in vivo via 
ascites fluid. Of particular interest, are monoclonal antibodies that recognize the 

25 catalytic domain or activation cleavage site (region) of an MTSP25. 

Additionally, the zymogen or two-chain form of the MTSP25 can be used 
to make monoclonal antibodies that recognize conformation epitopes. The 
desired monoclonal antibodies are then recovered from the culture supernatant 
or from the ascites supernatant. Fragments of the monoclonals or the polyclonal 

30 antisera which contain the immunologically significant portion can be used as 
antagonists, as well as the intact antibodies. Immunologically reactive 
fragments, such as the Fab, Fab', or F(ab') 2 fragments are often used, especially 
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in a therapeutic context, as these fragments are generally less immunogenic than 
the whole immunoglobulin. 

The antibodies or fragments can also be produced. Regions that bind 
specifically to the desired regions of receptor also can be produced in the 
5 context of chimeras with multiple species origin. 

Agents that are assayed in the above method can be randomly selected 
or rationally selected or designed. 

The agents can be, as examples, peptides, small molecules, and 
carbohydrates. A skilled artisan can readily recognize that there is no limit as to 
10 the structural nature of the agents. 

The peptide agents can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, 
the DNA encoding these peptides can be synthesized using commercially 
available oligonucleotide synthesis instrumentation and produced recombinantly 
1 5 using standard recombinant production systems. The production using solid 

phase peptide synthesis is necessitated if non-gene-encoded amino acids are to 
be included. 

G. Assay formats and selection of test substances that modulate at least 
one activity of an MTSP25 polypeptide 

20 Methods for identifying agents that modulate at least one activity of an 

MTSP25 are provided. The methods include phage display and other methods 
for assessing alterations in the activity of an MTSP25. Such methods or assays 
can use any means of monitoring or detecting the desired activity. A variety of 
formats and detection protocols are known for performing screening assays. 

25 Any such formats and protocols can be adapted for identifying modulators of 

MTSP25 polypeptide activities. The following includes a discussion of exemplary 
protocols. 

1 . High throughput screening assays 

Although the above-described assay can be conducted where a single 
30 MTSP25 polypeptide is screened, and/or a single test substance is screened in 
one assay, the assay typically is conducted in a high throughput screening 
mode, i.e., a plurality of the SP proteins are screened against and/or a plurality 
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of the test substances are screened simultaneously {See generally, High 
Throughput Screening: The Discovery of Bioactive Substances (Devlin, Ed.) 
Marcel Dekker, 1997; Sittampalam et al., Cum Opin. Chem. Biol. , /:384-91 
(1997); and Silverman et al., Curr. Opin. Chem. Biol., 2:397-403 (1998)). For 
5 example, the assay can be conducted in a multi-well {e.g., 24-, 48-, 96-, 384-, 
1 536-well or higher density), chip or array format. 

High-throughput screening (HTS) is the process of testing a large number 
of diverse chemical structures against disease targets to identify "hits" 
(Sittampalam et al., Curr. Opin. Chem. Biol., 7:384-91 (1997)). Current state-of- 
10 the-art HTS operations are highly automated and computerized to handle sample 
preparation, assay procedures and the subsequent processing of large volumes 
of data. 

Detection technologies employed in high-throughput screens depend on 
the type of biochemical pathway being investigated (Sittampalam et al., Curr. 

15 Opin. Chem. Biol., 7:384-91 (1997)). These methods include, radiochemical 

methods, such as the scintillation proximity assays (SPA), which can be adapted 
to a variety of enzyme assays (Lerner et al., J. Biomol. Screening, 1: 135-143 
(1996); Baker et al., Anal. Biochem. t 239:20-24 (1996); Baum et al., Anal. 
Biochem., 237:129-134 (1996); and Sullivan et al., J. Biomol. Screening 2:19- 

20 23 (1997)) and protein-protein interaction assays (Braunwalder et al., J. Biomol. 
Screening /:23-26 (1996); Sonatore et al.. Anal. Biochem. 240:289-297 (1996); 
and Chen et al,, J. Biol. Chem. 277:25308-25315 (1996)), and non-isotopic 
detection methods, including but are not limited to, colorimetric and 
luminescence detection methods, resonance energy transfer (RET) methods, 

25 time-resolved fluorescence (HTRF) methods, cell-based fluorescence assays, 
such as fluorescence resonance energy transfer (FRET) procedures (see, 
e.g., Gonzalez et al., Biophys. J., 65:1272-1280 (1995)), fluorescence 
polarization or anisotropy methods (see, e.g., Jameson et al., Methods Enzymoi. 
246:283-300 (1995); Jolley, J. Biomol. Screening 7:33-38 (1996); Lynch et al., 

30 Anal. Biochem. 247:77-82 (1997)), fluorescence correlation spectroscopy (FCS) 
and other such methods. 
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2. Test Substances 
Test compounds, including small molecules, antibodies, proteins, nucleic 
acids, peptides, natural products, extracts containing natural products and 
libraries and collections thereof, can be screened in the above-described assays 
5 and assays described below to identify compounds that modulate the activity of 
an MTSP25 polypeptide. Rational drug design methodologies that rely on 
computational chemistry can be used to screen and identify candidate 
compounds. 

The compounds identified by the screening methods include inhibitors, 
10 including antagonists, and can be agonists. Compounds for screening include 
any compounds and collections of compounds available, known or that can be 
prepared. 

a. Selection of Compounds 

Compounds can be selected for their potency and selectivity of inhibition 
15 of serine proteases, especially an MTSP25 polypeptide. As described herein, 
and as generally known, a target serine protease and its substrate are combined 
under assay conditions permitting reaction of the protease with its substrate. 
The assay is performed in the absence of test compound, and in the presence of 
increasing concentrations of the test compound. The concentration of test 
20 compound at which 50% of the serine protease activity is inhibited by the test 
compound is the IC 50 value (Inhibitory Concentration) or EC 50 (Effective 
Concentration) value for that compound. Within a series or group of test 
compounds, those having lower IC 50 or EC 50 values are considered more potent 
inhibitors of the serine protease than those compounds having higher IC 50 or 
25 EC 50 values. The IC 50 measurement is often used for more simplistic assays, 
whereas the EC 50 is often used for more complicated assays, such as those 
employing cells. 

Typically candidate compounds have an IC 50 value of 100 nM or less as 
measured in an in vitro assay for inhibition of MTSP25 polypeptide activity. The 
30 test compounds also are evaluated for selectivity toward a serine protease. As 
described herein, and as generally known, a test compound is assayed for its 
potency toward a panel of serine proteases and other enzymes and an IC S0 value 
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10 



15 



or EC 60 value is determined for each test compound in each assay system. A 
compound that demonstrates a low IC so value or EC S0 value for the target 
enzyme, e.g., MTSP25 polypeptide, and a higher IC 50 value or EC M value for 
other enzymes within the test panel [e.g., urokinase tissue plasminogen 
activator, thrombin, Factor Xa), is considered to be selective toward the target 
enzyme. Generally, a compound is deemed selective if its IC 50 value or EC 50 
value in the target enzyme assay is at least one order of magnitude less than the 
next smallest IC 50 value or EC S0 value measured in the selectivity panel of 
enzymes. 

Compounds are also evaluated for their activity in vivo. The type of 
assay chosen for evaluation of test compounds depends on the pathological 
condition to be treated or prevented by use of the compound, as well as the 
route of administration to be evaluated for the test compound. 

For instance, to evaluate the activity of a compound to reduce tumor 
growth through inhibition of MTSP25 polypeptide, the procedures described by 
Jankun et al.. Cane. Res. 57:559-563 (1997) to evaluate PAI-1 can be 
employed. Briefly, the ATCC cell lines DU145 and LnCaP are injected into SCID 
mice. After tumors are established, the mice are given test compound according 
to a dosing regime determined from the compound's in vitro characteristics. The 
20 Jankun et al. compound was administered in water. Tumor volume 

measurements are taken twice a week for about five weeks. A compound is 
deemed active if an animal to which the compound was administered exhibited 
decreased tumor volume, as compared to animals receiving appropriate control 
compounds. 

25 Another in vivo experimental model designed to evaluate the effect of p- 

aminobenzamidine, a swine protease inhibitor, on reducing tumor volume is 
described by Billstrdm et al.. Int. J. Cancer 57:542-547 (1995). 

To evaluate the ability of a compound to reduce the occurrence of, or 
inhibit, metastasis, the procedures described by Kobayashi et al. Int. J. Cane. 
57:727-733d (1994) can be employed. Briefly, a murine xenograft selected for 
high lung colonization potential in injected into C57B1/6 mice i.v. (experimental 
metastasis) or s.c. into the abdominal wall (spontaneous metastasis). Various 
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concentrations of the compound to be tested can be admixed with the tumor 
cells in Matrigel prior to injection. Daily i.p. injections of the test compound are 
made either on days 1-6 or days 7-13 after tumor inoculation. The animals are 
sacrificed about three or four weeks after tumor inoculation, and the lung tumor 
5 colonies are counted. Evaluation of the resulting data permits a determination as 
to efficacy of the test compound, optimal dosing and route of administration. 

The activity of the tested compounds toward decreasing tumor volume 
and metastasis can be evaluated in model described in Rabbani et al., Int. J. 
Cancer 65:840-845 (1995) to evaluate their inhibitor. There, Mat LyLu tumor 

10 cells were injected into the flank of Copenhagen rats. The animals were 

implanted with osmotic minipumps to continuously administer various doses of 
test compound for up to three weeks. The tumor mass and volume of 
experimental and control animals were evaluated during the experiment, as were 
metastatic growths. Evaluation of the resulting data permits a determination as 

15 to efficacy of the test compound, optimal dosing, and route of administration. 
Some of these authors described a related protocol in Xing et al., Cane. Res. 
£7:3585-3593 (1997). 

To evaluate the anti-angiogenesis activity of a compound, a rabbit cornea 
neovascularization model can be employed (see, e.g., Avery et ah (1990) Arch. 
20 Ophthalmol., 108:1474-147). Avery et al. describes anesthetizing New Zealand 
albino rabbits and then making a central corneal incision and forming a radial 
corneal pocket. A slow release prostaglandin pellet was placed in the pocket to 
induce neovascularization. Test compound was administered i.p. for five days, 
at which time the animals were sacrificed. The effect of the test compound is 
25 evaluated by review of periodic photographs taken of the limbus, which can be 
used to calculate the area of neovascutar response and, therefore, limbal 
neovascularization. A decreased area of neovascularization as compared with 
appropriate controls indicates the test compound was effective at decreasing or 
inhibiting neovascularization. 

An angiogenesis model used to evaluate the effect of a test compound in 
preventing angiogenesis is described by Min et al. Cane. Res. S6':2428-2433 
(1996). C57BL6 mice receive subcutaneous injections of a Matrigel mixture 
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containing bFGF, as the angiogenesis-inducing agent, with and without the test 
compound. After five days, the animals are sacrificed and the Matrigel plugs, in 
which neovascularization can be visualized, are photographed. An experimental 
animal receiving Matrigel and an effective dose of test compound exhibits less 
5 vascularization than a control animal or an experimental animal receiving a less- 
or non-effective does of compound. 

An in vivo system designed to test compounds for their ability to limit the 
spread of primary tumors is described by Crowley et aL, Proc. Natl. Acad. ScL 
<90:5O2 1-5025 (1993). Nude mice are injected with tumor cells (PC3) 
10 engineered to express CAT {chloramphenicol acetyltransf erase). Compounds to 
be tested for their ability to decrease tumor size and/or metastases are 
administered to the animals, and subsequent measurements of tumor size and/or 
metastatic growths are made. In addition, the level of CAT detected in various 
organs provides an indication of the ability of the test compound to inhibit 
15 metastasis; detection of less CAT in tissues of a treated animal versus a control 
animal indicates less CAT-expressing cells migrated to that tissue. 

in vivo experimental models designed to evaluate the inhibitory potential 
of a test serine protease inhibitors, using a tumor cell line F3II known to be 
highly invasive (see, e.g., Alonso et ai. (1996) Breast Cane. Res. Treat. 40:209- 
20 223) are provided. Alonso describes in vivo studies for toxicity determination, 
tumor growth, invasiveness, spontaneous metastasis, experimental lung 
metastasis, and an angiogenesis assay. 

The CAM model (chick embryo chorioallantoic membrane model; 
Ossowski (1988) J. CellBioL 707:2437-2445) , provides another method for 
25 evaluating the inhibitory activity of a test compound. In the CAM model, tumor 
cells invade through the chorioallantoic membrane containing CAM (with tumor 
cells in the presence of several serine protease inhibitors results in less or no 
invasion of the tumor cells through the membrane). Thus, the CAM assay is 
performed with CAM and tumor cells in the presence and absence of various 
30 concentrations of test compound. The invasiveness of tumor cells is measured 
under such conditions to provide an indication of the compound's inhibitory 
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activity. A compound having inhibitory activity correlates with less tumor 
invasion. 

The CAM model is also used in a standard assay of angiogenesis (i.e., 
effect on formation of new blood vessels (Brooks et aL Methods in Molecular 
5 Biology 729:267-269 (1999)). According to this model, a filter disc containing 
an angiogenesis inducer, such as basic fibroblast growth factor (bFGF) is placed 
onto the CAM. Diffusion of the cytokine into the CAM induces local 
angiogenesis, which can be measured in several ways such as by counting the 
number of blood vessel branch points within the CAM directly below the filter 

10 disc. The ability of identified compounds to inhibit cytokine-induced 

angiogenesis can be tested using this model. A test compound can either be 
added to the filter disc that contains the angiogenesis inducer, be placed directly 
on the membrane or be administered systemically. The extent of new blood 
vessel formation in the presence and/or absence of test compound can be 

15 compared using this model. The formation of fewer new blood vessels in the 
presence of a test compound would be indicative of anti-angiogenesis activity. 
Demonstration of anti-angiogenesis activity for inhibitors of an MTSP25 
polypeptide indicates a role in angiogenesis for that SP protein. 

b. Known serine protease inhibitors 

20 Compounds for screening can be serine protease inhibitors, which can be 

tested for their ability to inhibit the activity of an MTSP25. 
Exemplary, serine protease inhibitors for use in the screening assays, include, 
but are not limited to: Serine Protease Inhibitor 3 (SPI-3) (Chen, et aL Citokine, 
1 7:856-862 (1999)); Aprotinin (lijima, FL, et aL, J. Biochem. (Tokyo) 726:912- 

25 916 (1999)); Kazal-type serine protease inhibitor-like proteins (Niimi, et aL Eur. 
J. Biochem., 266:282-292 (1999)); Kunitz-type serine protease inhibitor 
(Ravichandran, S., et al., Acta Crystallogr. D. BioL Crystallogr., 55:1814-1821 
(1999)); Tissue factor pathway inhibitor-2/Matrix-associated serine protease 
inhibitor (TFPI-2/MSPI), (Liu, Y. et aL Arch. Biochem. Biophys. 370:112-8 

30 (1999)); Bukunin (Cui, C.Y. et aL J. Invest. Dermatol. / 73:1 82-8 (1 999)); 

Nafmostat mesilate (Ryo, R. et aL Vox Sang. 76:241-6 (1999)); TPCK (Huang 
et aL Oncogene 75:3431-3439 (1999)); A synthetic cotton-bound serine 
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protease inhibitor (Edwards (1999) eta/. Wound Repair Regen. 7:106-18); FUT- 
175 (Sawada (1999) etal. Stroke 30:644-50); Combination of serine protease 
inhibitor FUT-0175 and thromboxane synthetase inhibitor OKY-046 (Kaminogo et 
aL (1998) Neurol. Med. Chir. (Tokyo) 35:704-8; discussion 708-9); the rat 
5 serine protease inhibitor 2.1 gene (LeCam, A., et aL, Biochem. Biophys. Res. 
Commun., 253:31 1-4 (1998)); A new intracellular serine protease inhibitor 
expressed in the rat pituitary gland complexes with granzyme B (Hill et a/. FEBS 
Lett. 440:361-4 (1998)); 3,4-Dichloroisocoumarin (Hammed et aL Proc. Soc. 
Exp. Biol. Med., 273:132-7 (1998)); LEX032 (Bains etal. Eur. J. Pharmacol. 

10 356:67-72 (1998)); N-tosyl-L-phenylalanine chloromethyl ketone (Dryjanski etal. 
Biochemistry 37:14151-6 (1998)); Mouse gene for the serine protease inhibitor 
neuroserpin (P1 1 2) (Berger etal. Gene, 274:25-33 (1998)); Rat serine protease 
inhibitor 2.3 gene (Paul etal. Eur. J. Biochem. 254:538-46 (1998)); Ecotin 
(Yang etal. J. Mot. Biol. 275:945-57 (1998)); A 14 kDa plant-related serine 

15 protease inhibitor (Roch etal. Dev. Comp. Immunol. 22(1):1-12 (1998)); Matrix- 
associated serine protease inhibitor TFPI-2/33 kDa MSPl (Rao etal. Int. J. 
Cancer 75:749-56 (1998)); ONO-3403 (Hiwasa etal. Cancer Lett. 725:221-5 
(1998)); Bdellastasin (Moser etal. Eur. J. Biochem. 253:212-20 (1998)); Bikunin 
(Xu etal. J. MoL Biol. 275:955-66 (1998)); Nafamostat mesilate.(Mellgren etal. 

20 Thromb. Haemost. 79:342-7 (1998)); The growth hormone dependent serine 
protease inhibitor, Spi 2.1 (Maake etal. Endocrinology 738:5630-6 (1997)); 
Growth factor activator inhibitor type 2, a Kunitz-type serine protease inhibitor 
(Kawaguchi et aL J. Biol. Chem., 272:27558-64 (1997)); Heat-stable serine 
protease inhibitor protein from ovaries of the desert locust, Schistocerga gregaria 

25 (Hamdaoui etal. Biochem. Biophys. Res. Commun. 233:357-60 (1997)); Human 
placental Hepatocyte growth factor activator inhibitor, a Kunitz-type serine 
protease inhibitor (Shimomura etal. J. Biol. Chem. 272:6370-6 (1997)); FUT- 
187, oral serine protease inhibitor (Shiozaki etal. Gan To Kaguku Ryoho, 23(14): 
1971-9 (1996)); Extracellular matrix-associated serine protease inhibitors (Mr 

30 33,000, 31 ,000, and 27,000 (Rao, C.N., et al., Arch. Biochem. Biophys., 
335:82-92 (1996)); An irreversible isocoumarin serine protease inhibitor 
(Palencia, D.D., etal., Biol. Reprod., 55:536-42 (1996)); 4-(2-arninoethyl)- 
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benzenesulfonyi fluoride (AEBSF) (Nakabo et al. J. Leukoc. Biol. 50:328-36 
(1996)); Neuroserpin (Osterwalder, T., et al., EMBO J. 75:2944-53 (1996)); 
Human serine protease inhibitor alpha-1 -antitrypsin (Forney et al. J. Parasitol.. 
52:496-502 (1996)); Rat serine protease inhibitor 2.3 (Simar-Blanchet, A.E., et 
5 ai., Eur. J. Biochem., 235:638-48 (1996)); Gebaxate mesilate (parodi, F., et a!., 
J. Cardiothorac. Vase. Anesth. 70:235-7 (1 996)); Recombinant serine protease 
inhibitor, CPTI il (Stankiewicz, M., et al., (Acta Biochim. Pol., 43(3) :525-9 
(1996)); A cysteine-rich serine protease inhibitor (Guamerin II) (Kim, D.R., et al., 
J. Enzym. Inhib., /0:81-91 (1 996)); Diisopropylfluorophosphate (Lundqvist, H., 

10 et al., Inflamm. Res., 44(12):510-7 (1995)); Nexin 1 (Yu, D.W., et al., J. Cell 

ScL, 108(Pt 12):3867-74 (1995)); LEX032 (Scalia, R., et al., Shock, 4(4):251-6 
(1995)); Protease nexin I (Houenou, L.J., et al., Proc. Natl. Acad. ScL U.S.A., 
92131:895-9 (1995)); Chymase-directed serine protease inhibitor (Woodard S.L., 
et al., J. Immunol., 1 53(1 1 ):501 6-25 (1994)); N-alpha-tosyl-L-lysyl-chloromethyl 

15 ketone (TLCK) (Bourinbaiar, A.S., et al., Cell Immunol., 155(1) :230-6 (1994)); 

Smpi56 (Ghendler, Y., et al., Exp. Parasitol., 78(2) :121-31 (1994)); Schistosoma 
haematobium serine protease (Blanton, R.E., et al., Mol. Biochem. Parasitol., 
■ 63111:1-1 1 (1994)); Spi-1 (Warren, W.C., et al., Mol. Cell Endocrinol., 98(1 ) :27- 



32 (1993)); TAME (Jessop, J.J., et a!., Inflammation, 17(5) :613-31 (1993)), 
20 Antithrombin III (Kalaria, R.N., et al., Am. J. Pathol. , 1 43(3) :886-93 (1993)) 

FOY-305 (Ohkoshi, M., et al., Anticancer Res., 13141:963-6 (1993)); Camostat 
mesilate (Senda, S., et al., Intern. Med., 32(41 :350-4 (1993)); Pigment 
epithelium-derived factor (Steele, F.R., et al., Proc. Natl. Acad. ScL U.S.A., 
90141:1526-30 (1993)); Antistasin (Holstein, T.W., et al., FEBS Lett., 
25 309131:288-92 (1992)); The vaccinia virus K2L gene encodes a serine protease 
inhibitor (Zhou, J., et al., Virology, 1 89(2) :678-86 (1992)); Bowman-Birk serine- 
protease inhibitor (Werner, M.H., et al., J. Mol. Biol., 225(3) :873-89 (1992); 
FUT-175 (Yanamoto, H., et al., Neurosurgery, 30131:358-63 (1992)); FUT-175; 
(Yanamoto, H„ et al., Neurosurgery, 30(3) :351-6. discussion 356-7 (1992)); 
30 PAI-I (Yreadwell, B.V., et al., J. Orthop. Res., £131:309-16 (1991)); 3,4- 

Dichloroisocoumarin (Rusbridge, N.M., et al., FEBS Lett., 26801:1 33-6 (1990)); 
Alpha 1-antichymotrypsin (Lindmark, B.E., et ah, Am. Rev. Respir. Des., 141(4 
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£Lll:884-8 (1990)); P-toluenesulfonyl-L-arginine methyl ester (TAME) (Scuderi, 
P., J. Immunol.. 143(1):1 68-73 (1989)); Alpha 1-antichymotrypsin (Abraham, 
C.R., et al.. Cell, 52(41 :487-501 (1988)); Contrapsin (Modha, J., et al., 
Parasitology, 96 (Pt 1):99-109 (1988)); Alpha 2-antiplasmin (Holmes, W.E., et 
5 al., J. Biol. Chem., 262(4):1 659-64 (1987)); 3,4-dichloroisocoumarin (Harper, 
J.W., et al., Biochemistry, 24(8): 1 831-41 (1985)); Diisopropylfluorophosphate 
(Tsutsui, K„ et al., Biochem. Biophys. Res. Commun., 123(1 ) :271-7 (1984)); 
Gabexate mesilate (Hesse, B., et al., Pharmacol. Res. Commun., 16(7) :637-45 
(1984)); Phenyl methyl sulfonyl fluoride (Dufer, J., et al., Scand. J. Haematol., 

0 3211j:25-32 (1984)); Protease inhibitor CI-2 (McPhalen, C.A., et al., J. Mol. 
Biol, 168(21:445-7 (1 983)); Phenylmethylsutfonyl fluoride (Sekar V., et a!., 
Biochem. Biophys. Res. Commun., 89(21 :474-8 (1979)); PGE1 (Feinstein, M.D., 
et al., Prostaglandins, 1 4(6) :1 075-93 (1977). 

c. Combinatorial libraries and other libraries 
5 The source of compounds for the screening assays, can be libraries, 

including, but are not limited to, combinatorial libraries. Methods for 
synthesizing combinatorial libraries and characteristics of such combinatorial 
libraries are known in the art (See generally. Combinatorial Libraries: Synthesis, 
Screening and Application Potential (Cortese Ed.) Walter de Gruyter, Inc., 1995; 
D Tietze and Lieb, Cum Opin. Chem. Biol., 2(31:363-71 (1998); Lam, Anticancer 
Drug Des., 12(3):145-67 (1997); Blaney and Martin, Curr. Opin. Chem. Biol, 
1111:54-9 (1997); and Schultz and Schultz, Biotechnol Prog., l_2j61:729-43 
(1996)). 

Methods and strategies for generating diverse libraries, primarily peptide- 

1 and nucleotide-based oligomer libraries, have been developed using molecular 
biology methods and/or simultaneous chemical synthesis methodologies [see, 
e.g.. Dower et al., Annu. Rep. Med. Chem., 26:271-280 (1991); Fodor et al., 
Science, 251:767-773 (1991); Jung et al., Angew. Chem. Ind. Ed. Engl, 
31:367-383 (1992); Zuckerman et al., Proc. Natl Acad. ScL USA, 89:4505- 
4509 (1992); Scott et al., Science, 249:386-390 (1990); Devlin et al., Science, 
249:404-406 (1990); Cwirla et al., Proc. Natl Acad. Sci. USA, 87:6378-6382 
(1990); and Gallop et al., J. Medicinal Chemistry, 37:1233-1251 (1994)). The 
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resulting combinatorial libraries potentially contain millions of compounds that 
can be screened to identify compounds that exhibit a selected activity. 

The libraries fall into roughly three categories: fusion-protein-displayed 
peptide libraries in which random peptides or proteins are presented on the 
5 surface of phage particles or proteins expressed from plasmids; support-bound 
synthetic chemical libraries in which individual compounds or mixtures of 
compounds are presented on insoluble matrices, such as resin beads (see, e.g., 
Lam et aL, Nature, 354 :82-84 (1991)) and cotton supports (see, e.g., Eichler et 
al., Biochemistry 32:11035-11041 (1993)); and methods in which the 
10 compounds are used in solution (see, e.g., Houghten et al., Nature, 354 :84-86 
(1991); Houghten et al., BioTechniques, 313 :412-421 (1992); and Scott et al., 
Curr. Opin. BiotechnoL, 5:40-48 (1994)). There are numerous examples of 
synthetic peptide and oligonucleotide combinatorial libraries and there are many 
methods for producing libraries that contain non-peptidic small organic mole- 
1 5 cules. Such libraries can be based on a basis set of monomers that are 

combined to form mixtures of diverse organic molecules or that can be combined 
to form a library based upon a selected pharmacophore monomer. 

Either a random or a deterministic combinatorial library can be screened 
by the presently disclosed and/or claimed screening methods. In either of these 
20 two libraries, each unit of the library is isolated and/or immobilized on a solid 
support. In the deterministic library, one knows a priori a particular unit's 
location on each solid support. In a random library, the location of a particular 
unit is not known a priori although each site still contains a single unique unit. 
Many methods for preparing libraries are known to those of skill in this art (see, 
25 e.g., Geysen et a!., Proc. Nat/. Acad. Sci. USA, 81:3998-4002 (1984), 
Houghten et al., Proc. NatL Acad. Sci. USA, 81:5131-5135 (1985)). 
Combinatorial library generated by the any techniques known to those of skill in 
the art are contemplated (see, e.g., Table 1 of Schultz and Schultz, Biotechnoi. 
Prog., 12(6) :729-43 (1996)) for screening; Bartel eta/., Science, 261 :1411- 
30 1418 (1993); Baumbach et aL BioPharm, (Can) :24-35 (1992); Bock et al. 
Nature, 355 :564-566 (1992); Borman, S., Combinatorial chemists focus on 
small molecules molecular recognition, and automation, Chem. Eng. News, 



WO 03/031585 



PCT/US02/32417 



-96- 

2(121:29 (1996); Boublik, et al., Eukaryotic Virus Display: Engineering the Major 
Surface Glycoproteins of the Autographa California Nuclear Polyhedrosis Virus 
(ACNPV) for the Presentation of Foreign Proteins on the Virus Surface, 
Bio/Technology, 13:1079-1084 (1995); Brenner, et al.. Encoded Combinatorial 
5 Chemistry, Proc. Natl. Acad Sci. U.S.A., 89:5381 -5383 (1 992); Caflisch, et al., 
Computational Combinatorial Chemistry for De Novo Ligand Design: Review and 
Assessment, Perspect. Drug Discovery Des., 3:51-84 (1995); Cheng, et al., 
Sequence-Selective Peptide Binding with a Peptido-A,B-f/<?/Js-steroidal Receptor 
Selected from an Encoded Combinatorial Library, J. Am. Chem. Soc, 1 18 :1813- 
10 1814 (1996); Chu, et al., Affinity Capillary Electrophoresis to Identify the 
Peptide in A Peptide Library that Binds Most Tightly to Vancomycin, J. Org. 
Chem., 58:648-652 (1993); Clackson, et al.. Making Antibody Fragments Using 
Phage Display Libraries, Nature, 352:624-628 (1991); Combs, et aL, Protein 
Structure-Based Combinatorial Chemistry: Discovery of Non-Peptide Binding 
15 Elements to Src SH3 Domain, J. Am. Chem. Soc, 1 18 :287-288 (1996); Cwirla, 
et al., Peptides On Phage: A Vast Library of Peptides for Identifying Ligands, 
Proc. Natl. Acad. Sci. U.S.A., 87:6378-6382 (1990); Ecker, et al., Combinatorial 
Drug Discovery: Which Method will Produce the Greatest Value, 
Bio/Technology, 13:351-360 (1995); Ellington, et al., In Vitro Selection of RNA 
20 Molecules That Bind Specific Ligands, Nature, 346:81 8-822 (1990); Ellman, 
J. A., Variants of Benzodiazepines, J. Am. Chem. Soc, 1 14 :10997 (1992); 
Erickson, et al., The Proteins) Neurath, H., Hill, R.L., Eds.: Academic: New York, 
1976; pp. 255-257; Felici, et al., J. Moi. Biol., 222 :301-310 (1991); Fodor, et 
al., Light-Directed, Spatially Addressable Parallel Chemical Synthesis, Science, 
25 251:767-773 (1991); Francisco, et al., Transport and Anchoring of Beta- 
Lactamase to the External Surface of E. ColL, Proc. Natl. Acad. Sci. U.S.A., 
89:2713-2717 (1992); Georgiou, et al., Practical Applications of Engineering 
Gram-Negative Bacterial Cell Surfaces, TIBTECH, 11:6-10 (1993); Geysen, et al., 
Use of peptide synthesis to probe viral antigens for epitopes to a resolution of a 
30 single amino acid, Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Glaser, 
et al., Antibody Engineering by Codon-Based Mutagenesis in a Filamentous 
Phage Vector System, J. Immunol., 149:3903-3913 (1992); Gram, et al., In 
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vitro selection and affinity maturation of antibodies from a naive combinatorial 
immunoglobulin library, Proc. NatL Acad. ScL, 89:3576-3580 (1992); Han, et 
al., Liquid-Phase Combinatorial Synthesis, Proc. NatL Acad. ScL U.S.A., 
92:6419-6423 (1995); Hoogenboom, et al., Multi-Subunit Proteins on the 
5 Surface of Filamentous Phage: Methodologies for Displaying Antibody (Fab) 
Heavy and Light Chains, Nucleic Acids Res., 19:4133-4137 (1991); Houghten, 
et al.. General Method for the Rapid Solid-Phase Synthesis of Large Numbers of 
Peptides: Specificity of Antigen-Antibody Interaction at the Level of Individual 
Amino Acids, Proc. NatL Acad. ScL U.S.A., 82:5131-5135 (1985); Houghten, 
10 et al., The Use of Synthetic Peptide Combinatorial Libraries for the Determination 
of Peptide Ligands in Radio-Receptor Assays-Opiod-Peptides, Bioorg. Med. 
Chem. Lett., 3:405-412 (1993); Houghten, et al., Generation and Use of 
Synthetic Peptide Combinatorial Libraries for Basic Research and Drug Discovery, 
Nature, 354:84-86 <1991); Huang, et al., Discovery of New Ligand Binding 
15 Pathways in Myoglobin by Random Mutagenesis, Nature Struct. BioL, 1:226-229 
(1994); Huse, et aL, Generation of a Large Combinatorial Library of the 
Immunoglobulin Repertoire In Phage Lambda, Science, 246:1275-1281 (1989); 
Janda, K.D., New Strategies for the Design of Catalytic Antibodies, BiotechnoL 
Prog., 6:178-181 (1990); Jung, et aL, Multiple Peptide Synthesis Methods and 
20 Their Applications, Angew. Chem. Int. Ed. Engl., 31:367-486 (1992); Kang, et 
al., Linkage of Recognition and Replication Functions By Assembling 
Combinatorial Antibody Fab Libraries Along Phage Surfaces, Proc. NatL Acad. 
ScL U.S.A.. 88:4363-4366 (1991a); Kang, et al.. Antibody Redesign by Chain 
Shuffling from Random Combinatorial Immunoglobulin Libraries, Proc. Natl. 
25 Acad. Sci. U.S.A., 88:1 1 120-1 1 123 (1991b); Kay, et al., An M13 Phage Library 
Displaying Random 38-Amino-Acid-Peptides as a Source of Novel Sequences 
with Affinity to Selected Targets Genes, Gene, 128:59-65 (1993); Lam, et al., A 
new type of synthetic peptide library for identifying ligand-binding activity, 
Nature, 354:82-84 (1991) (published errata appear in Nature, 358 :434 (1992) 
30 and Nature, 360:768 (1992); Lebl, et al., One Bead One Structure Combinatorial 
Libraries, Biopolymers (Pept. ScL), 37:1 77-1 98 (1995); Lerner, et al.. Antibodies 
without Immunization, Science, 258:1313-1314 (1992); Li, et al.. Minimization 
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of a Polypeptide Hormone, Science, 270 :1657-1660 (1995); Light, et al., 
Display of Dimeric Bacterial Alkaline Phosphatase on the Major Coat Protein of 
Filamentous Bacteriophage, Bioorg. Med. Chem. Lett., 3:1073-1079 (1992); 
Little, et al., Bacterial Surface Presentation of Proteins and Peptides; An 
5 Alternative to Phage Technology, Trends Biotechnol., 1_1 :3-5 (1993); Marks, et 
al., By-Passing Immunization. Human Antibodies from V-Gene Libraries 
Displayed on Phage, J. Mol. Bio/., 222:581-597 (1991); Matthews, et al., 
Substrate Phage: Selection of Protease Substrates by Monovalent Phage Display, 
Science, 260:1 113-1117 (1993); McCafferty, et al., Phage Enzymes: Expression 
10 and Affinity Chromatography of Functional Alkaline Phosphatase on the Surface 
of Bacteriophage, Protein Eng., 4:955-961 (1991); Menger, et al., Phosphatase 
Catalysis Developed Via Combinatorial Organic Chemistry, J. Org. Chem., 
60:6666-6667 (1995); Nicolaou, et al., Angew. Chem. int. Ed. Engt., 34:2289- 
2291 (1995); Oldenburg, et al., Peptide Ligands for A Sugar-Binding Protein 
15 Isolated from a Random Peptide Library, Proc. Natl. Acad. Sci. U.S.A., 89:5393- 
5397 (1992); Parmley, et al., Antibody-Selectable Filamentous fd Phage Vectors: 
Affinity Purification of Target Genes, Genes, 73:305-318 (1988); Pinilla, et al., 
Synthetic Peptide Combinatorial Libraries (SPCLS)-ldentification of the Antigenic 
Determinant of Beta-Endorphin Recognized by Monoclonal Antibody-3E7, Gene, 
20 128:71-76 (1 993); Pinilla, et al.. Review of the Utility of Soluble Combinatorial 
Libraries, Biopolymers, 37:221-240 (1995); Pistor, et al., Expression of Viral 
Hemagglutinan On the Surface of E. Cofi., Klin. Wochenschr., 66:1 10-1 16 
(1989); Pollack, eta)., Selective Chemical Catalysis by an Antibody, Science, 
,234:1570-1572 (1986); Rigler, et al., Fluorescence Correlations, Single Molecule 
25 Detection and Large Number Screening: Applications in Biotechnology, J. 
BiotechnoL, 41_:177-186 (1995); Sarvetnick, et a(., Increasing the Chemical 
Potential of the Germ-Line Antibody Repertoire, Proc. Natl. Acad. Sci. U.S.A., 
90:4008-401 1 (1993); Sastry, et al., Cloning of the Immunological Repertiore in 
Escherichia Co/i for Generation of Monoclonal Catalytic Antibodies: Construction 
30 of a Heavy Chain Variable Region-Specific cDNA Library, Proc. Natl. Acad. Sci. 
U.S.A., 86:5728-5732 (1989); Scott, et al.. Searching for Peptide Ligands with 
an Epitope Library, Science, 249:386-390 (1990); Sears, et al., Engineering 
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Enzymes for Bioorganic Synthesis: Peptide Bond Formation, Biotechnol. Prog., 
12:423-433 (1996); Simon, et. al., Peptides: A Modular Approach to Drug 
Discovery, Proc. NatL Acad. Sci. U.S.A., 89:9367-9371 (1992); Still, et al., 
Discovery of Sequence-Selective Peptide Binding by Synthetic Receptors Using 
5 Encoded Combinatorial Libraries, Acc. Chem. Res., 29:155-1 63 (1996); 

Thompson, et al., Synthesis and Applications of Small Molecule Libraries, Chem. 
Rev., 96:555-600 (1996); Tramontane et al.. Catalytic Antibodies, Science, 
234 :1566-1570 (1986); Wrighton, et al., Small Peptides as Potent Mimetics of 
the Protein Hormone Erythropoietin, Science, 273 :458-464 (1996); York, et al., 

10 Combinatorial mutagenesis of the reactive site region in plasminogen activator 
inhibitor I, J. Bio!. Chem., 266 :8595-8600 (1991); Zebedee, et al.. Human 
Combinatorial Antibody Libraries to Hepatitis B Surface Antigen, Proc. Natl. 
Acad. Set. U.S.A., 89.31 75-31 79 (1992); Zuckerman, et al.. Identification of 
Highest-Affinity Ligands by Affinity Selection from Equimolar Peptide Mixtures 

15 Generated by Robotic Synthesis, Proc. NatL Acad. Sci. U.S.A., 89:4505-4509 
(1992). 

For example, peptides that bind to an MTSP25 polypeptide or a protease 
domain of an SP protein can be identified using phage display libraries. In an 
exemplary embodiment, this method can include a) contacting phage from a 
20 phage library with the MTSP25 polypeptide or a protease domain thereof; (b) 
isolating phage that bind to the protein; and (c) determining the identity of at 
least one peptide coded by the isolated phage to identify a peptide that binds to 
an MTSP25 polypeptide. 

H. Modulators of the activity of MTSP25 polypeptides 

25 Provided herein are compounds, identified by screening or produced using 

the MTSP25 polypeptide or protease domain in other screening methods, that 
modulate the activity of an MTSP25. These compounds act by directly 
interacting with the MTSP25 polypeptide or by altering transcription or 
translation thereof. Such molecules include, but are not limited to, antibodies 

30 that specifically react with an MTSP25 polypeptide, particularly with the 
protease domain thereof, antisense nucleic acids or double-stranded RNA 
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(dsRNA) such as RNAi, that alter expression of the MTSP25 polypeptide, 
antibodies, peptide mimetics and other such compounds. Antisense and other 
RNA molecules can include modified bases and nucleotide analogs, and modified 
backbones, including modified sugar moieties, and modified phosphate groups, 
5 such as phosphorothioates and phosphoramidates and other such groups known 
to those of skill in the art. 
1 . Antibodies 

Antibodies, including polyclonal and monoclonal antibodies, that 
specifically bind to the MTSP25 polypeptide provided herein, particularly 
10 antibodies that bind to the single chain protease domains thereof or the activated 
forms of the full-length or protease domain, but that do not bind to the zymogen 
form, are provided. 

Generally, the antibody is a monoclonal antibody, and typically the 
antibody specifically binds to the protease domain of the MTSP25 polypeptide. 

15 In particular embodiments, antibodies to each of the activated single chain 
and/or activated two chain form of the protease domain of MTSP25 are 
provided. Also provided are antibodies that specifically bind to any domain of 
MTSP25 and to two chain forms thereof. 

The MTSP25 polypeptide and domains, fragments, homologs and 

20 derivatives thereof can be used as immunogens to generate antibodies that 

specifically bind such immunogens. Such antibodies include but are not limited 
to polyclonal, monoclonal, chimeric, single chain, Fab fragments, and an Fab 
expression library. In a specific embodiment, antibodies to human MTSP25 
polypeptide are produced. In another embodiment, complexes formed from 

25 fragments of an MTSP25 polypeptide, that contain the serine protease domain 
are used as immunogens for antibody production. 

Various procedures known in the art can be used for the production of 
polyclonal antibodies to MTSP25 polypeptide, its domains, derivatives, 
fragments or analogs. For production of the antibody, various host animals can 

30 be immunized by injection with the native MTSP25 polypeptide or a synthetic 
version, or a derivative of the foregoing, such as a cross-linked MTSP25 
polypeptide. Such host animals include but are not limited to rabbits, mice, rats, 
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etc. Various adjuvants can be used to increase the immunological response, 
depending on the host species, and include but are not limited to Freund's 
(complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
5 emulsions, dinitrophenol, and potentially useful human adjuvants such as bacille 
Calmette-Guerin (BCG) and corynebacterium parvum. 

For preparation of monoclonal antibodies directed towards an MTSP25 
polypeptide or domains, derivatives, fragments or analogs thereof, any technique 
that provides for the production of antibody molecules by continuous cell lines in 
10 culture can be used. Such techniques include but are not restricted to the 
hybridoma technique originally developed by Kohler and Milstein (Nature 
256:495-497 (1975)), the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al., Immunology Today 4:72 (1983)), and the EBV 
hybridoma technique to produce human monoclonal antibodies (Cole et ak, in 
15 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 

(1985)). In an additional embodiment, monoclonal antibodies can be produced in 
germ-free animals utilizing recent technology (PCT/US90/02545). Human 
antibodies can be used and can be obtained by using human hybridomas (Cote 
et al., Proc. Natl. Acad. Set. USA 80:2026-2030 (1983)), or by transforming 
20 human B cells with EBV virus in vitro (Cole et al., in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)). Techniques developed 
for the production of ^chimeric antibodies* (Morrison et al., Proc. Natl. Acad. 
Sci. USA 81 :685 1-6855 (1984); Neuberger et ah, Nature 312 :604-608 (1984); 
Takeda et al., Nature 314:452-454 (1985)) by splicing the genes from a mouse 
25 antibody molecule specific for the MTSP25 polypeptide together with genes from 
a human antibody molecule of appropriate biological activity can be used. 

MTSP25-encoding nucleic acid molecules or portions thereof can be used 
in DNA immunization protocols to produce antibodies that bind to MTSP25 (see, 
e.g., U.S. Patent No. 5,795,872 and U.S. Patent No. 5,643,578 and U.S. Patent 
30 No. 6,337,072). 

Techniques described for the production of single chain antibodies (U.S. 
patent 4,946,778) can be adapted to produce MTSP25 poiypeptide-specific 
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single chain antibodies. An additional embodiment uses the techniques 
described for the construction of Fab expression libraries (Huse et a!., Science 
246:1275-1281 (1989)) to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity for MTSP25 polypeptide or domains, 
5 derivatives, or analogs thereof. Non-human antibodies can be "humanized" by 
known methods {see, e.g., U.S. Patent No. 5,225,539). 

Antibody fragments that specifically bind to MTSP25 polypeptide or 
epitopes thereof can be generated by techniques known in the art. For example, 
such fragments include but are not limited to: the F(ab')2 fragment, which can 
10 be produced by pepsin digestion of the antibody molecule; the Fab' fragments 
that can be generated by reducing the disulfide bridges of the F(ab')2 fragment; 
the Fab fragments that can be generated by treating the antibody molecule with 
papain and a reducing agent; and Fv fragments. 

In the production of antibodies, screening for the desired antibody can be 
15 accomplished by techniques known in the art, e.g., ELISA (enzyme-linked 

immunosorbent assay). To select antibodies specific for a particular domain of 
the MTSP25 polypeptide one can assay generated hybridomas for a product that 
binds to the fragment of the MTSP25 polypeptide that contains such a domain. 
The foregoing antibodies can be used in methods known in the art 
20 relating to the localization and/or quantitation of MTSP25 polypeptide proteins, 
e.g., for imaging these proteins, measuring levels thereof in appropriate 
physiological samples, in, for example, diagnostic methods. In another 
embodiment, anti-MTSP25 polypeptide antibodies, or fragments thereof, 
containing the binding domain are used as therapeutic agents. 
25 2. Peptides, Polypeptides and Peptide Mimetics 

Provided herein are methods for identifying molecules that bind to and 
modulate the activity of SP proteins. Included among molecules that bind to 
SPs, particularly the single chain protease domain or catalytically active 
fragments thereof, are peptides, polypeptides and peptide mimetics, including 
30 cyclic peptides. Peptide mimetics are molecules or compounds that mimic the 
necessary molecular conformation of a ligand or polypeptide for specific binding 
to a target molecule such as an MTSP25 polypeptide. In an exemplary 
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embodiment, the peptides, polypeptides and peptide mimetics bind to the 
protease domain of the MTSP25 po.ypeptide. Such peptides and peptide 
mimetics include those of antibodies that specifically bind to an MTSP25 
polypeptide and, typically, bind to the protease domain of an MTSP25 
5 polypeptide. The peptides, polypeptides and peptide mimetics identified by 
methods provided herein can be agonists or antagonists of MTSP25 
polypeptides. 

Such peptides, polypeptides and peptide mimetics are useful for 
diagnosing, treating, preventing, and screening for a disease or disorder 
10 associated with MTSP25 polypeptide activity in a mammal. In addition, the 

peptides and peptide mimetics are useful for identifying, isolating, and purifying 
molecules or compounds that modulate the activity of an MTSP25 polypeptide, 
or specifically bind to an MTSP25 polypeptide, generally the protease domain of 
an MTSP25 polypeptide. Low molecular weight peptides and peptide mimetics 
15 can have strong binding properties to a target molecule, e.g., an MTSP25 
polypeptide or the protease domain of an MTSP25 polypeptide. 

Peptides, polypeptides and peptide mimetics that bind to MTSP25 
polypeptides as described herein can be administered to mammals, including 
humans, to modulate MTSP25 polypeptide activity. Thus, methods for 
20 therapeutic treatment and prevention of neoplastic diseases comprise 

administering a peptide, polypeptide or peptide mimetic compound in an amount 
sufficient to modulate such activity are provided. Thus, also provided herein are 
methods for treating a subject having such a disease or disorder in which a 
peptide, polypeptide or peptide mimetic compound is administered to the subject 
25 in a therapeutically effective dose or amount. 

Compositions containing the peptides, polypeptides or peptide mimetics 
can be administered for prophylactic and/or therapeutic treatments. In 
therapeutic applications, compositions can be administered to a patient already 
suffering from a disease, as described above, in an amount sufficient to cure or 
at least partially arrest the symptoms of the disease and its complications. 
Amounts effective for this use will depend on the severity of the disease and the 
weight and general state of the patient and can be empirically determined. 



30 
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In prophylactic applications, compositions containing the peptides, 
polypeptides and peptide mimetics are administered to a patient susceptible to or 
otherwise at risk of a particular disease. Such an amount is defined to be a 
"prophylactically effective dose." In this use, the precise amounts again depend 
5 on the patient's state of health and weight. Accordingly, the peptides, 

polypeptides and peptide mimetics that bind to an MTSP25 polypeptide can be 
used to prepare pharmaceutical compositions containing, as an active ingredient, 
at least one of the peptides, polypeptides or peptide mimetics in association with 
a pharmaceutical carrier or diluent. The compounds can be administered, for 
10 example, by oral, pulmonary, parenteral (intramuscular, intraperitoneal, 
intravenous (IV) or subcutaneous injection), inhalation (via a fine powder 
formulation), transdermal, nasal, vaginal, rectal, or sublingual routes of 
administration and can be formulated in dosage forms appropriate for each route 
of administration (see, e.g., International PCT application Nos. WO 93/25221 
15 and WO 94/17784; and European Patent Application 613,683). 

Peptides, polypeptides and peptide mimetics that bind to MTSP25 
polypeptides are useful in vitro as unique tools for understanding the biological 
role of MTSP25 polypeptides, including the evaluation of the many factors 
thought to influence, and be influenced by, the production of MTSP25 
20 polypeptide. Such peptides, polypeptides and peptide mimetics are also useful in 
the development of other compounds that bind to and modulate the activity of 
an MTSP25 polypeptide, because such compounds provide important 
information on the relationship between structure and activity that should 
facilitate such development. 
25 The peptides, polypeptides and peptide mimetics are also useful as 

competitive binders in assays to screen for new MTSP25 polypeptides or 
MTSP25 polypeptide agonists. In such assay embodiments, the compounds can 
be used without modification or can be modified in a variety of ways; for 
example, by labeling, such as covalently or non-covalently joining a moiety 
30 which directly or indirectly provides a detectable signal. In any of these assays, 
the materials thereto can be labeled either directly or indirectly. Possibilities for 
direct labeling include label groups such as: radiolabels such as 125 l enzymes 
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10 



(U.S. Pat. No. 3,645,090) such as peroxidase and alkaline phosphatase, and 
fluorescent labels (U.S. Pat. No. 3,940,475) capable of monitoring the change in 
fluorescence intensity, wavelength shift, or fluorescence polarization. 
Possibilities for indirect labeling include biotinylation of one constituent followed 
by binding to avidin coupled to one of the above label groups. The compounds 
can also include spacers or linkers in cases where the compounds are to be 
attached to a solid support. 

Moreover, based on their ability to bind to an MTSP25 polypeptide, the 
peptides, polypeptides and peptide mimetics can be used as reagents for 
detecting MTSP25 polypeptides in living cells, fixed cells, in biological fluids, in 
tissue homogenates and in purified, natural biological materials. For example, 
by labelling such peptides, polypeptides and peptide mimetics, cells having 
MTSP25 polypeptides can be identified. In addition, based on their ability to 
bind an MTSP25 polypeptide, the peptides, polypeptides and peptide mimetics 
15 can be used in in situ staining, FACS (fluorescence-activated cell sorting), 

Western blotting, ELISA and other analytical protocols. Based on their ability to 
bind to an MTSP25 polypeptide, the peptides, polypeptides and peptide mimetics 
can be used in purification of MTSP25 polypeptides or in purifying cells 
expressing the MTSP25 polypeptides, e.g., a polypeptide encoding the protease 
20 domain of an MTSP25 polypeptide. 

The peptides, polypeptides and peptide mimetics can also be used as 
commercial reagents for various medical research and diagnostic uses. The 
activity of the peptides and peptide mimetics can be evaluated either in vitro or 
in vivo in one of the numerous models described in McDonald (1992) Am. J. of 
25 Pediatric Hematofogy/Oncology, 14.8-2']. 

3. Peptide, polypeptide and peptide mimetic therapy 
Peptide analogs are commonly used in the pharmaceutical industry as 
non-peptide drugs with properties analogous to those of the template peptide. 
These types of non-peptide compounds are termed "peptide mimetics" or 
30 "peptidomimetics" (Luthman etaL, A Textbook of Drug Design and 

Development 14:386-406, 2nd Ed., Harwood Academic Publishers (1996); 
Joachim Grante (1994) Angew. Chem. int. Ed. EngL, 33:1699-1720; Fauchere 
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(1986) J. Adv. Drug Res., 75:29; Veber and Freidinger (1985) TINS, p. 392; and 
Evans et al. (1987) J. Med. Chem. 50:1229). Peptide mimetics that are 
structurally similar to therapeutically useful peptides can be used to produce an 
equivalent or enhanced therapeutic or prophylactic effect. Preparation of 
5 peptidomimetics and structures thereof are known to those of skill in this art. 
Systematic substitution of one or more amino acids of a consensus 
sequence with a D-amino acid of the same type (e.g., D-lysine in place of 
L-lysine) can be used to generate more stable peptides. In addition, constrained 
peptides containing a consensus sequence or a substantially identical consensus 
1 0 sequence variation can be generated by methods known in the art (Rizo et al. 
(1992) An. Rev. Biochem., 67:387, incorporated herein by reference); for 
example, by adding internal cysteine residues capable of forming intramolecular 
disulfide bridges which cyclize the peptide. 

Those skilled in the art appreciate that modifications can be made to the 
15 peptides and mimetics without deleteriously effecting the biological or functional 
activity of the peptide. Further, the skilled artisan would know how to design 
non-peptide structures in three dimensional terms, that mimic the peptides that 
bind to a target molecule, e.g., an MTSP25 polypeptide or, generally, the 
protease domain of MTSP25 polypeptides (see, e.g., Eck and Sprang (1 989) J. 
20 Biol. Chem., 26; 17605-18795). 

When used for diagnostic purposes, the peptides and peptide mimetics 
can be labeled with a detectable label and, accordingly, the peptides and peptide 
mimetics without such a label can serve as intermediates in the preparation of 
labeled peptides and peptide mimetics. Detectable labels can be molecules or 
25 compounds, which when covalently attached to the peptides and peptide 
mimetics, permit detection of the peptide and peptide mimetics in vivo, for 
example, in a patient to whom the peptide or peptide mimetic has been 
administered, or in vitro, e.g., in a sample or cells. Suitable detectable labels are 
well known in the art and include, by way of example, radioisotopes, fluorescent 
30 labels (e.g., fluorescein), and the like. The particular detectable label employed 
is not critical and is selected to be detectable at non-toxic levels. Selection of 
the such labels is well within the skill of the art. 
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Covalent attachment of a detectable label to the peptide or peptide 
mimetic is accomplished by conventional methods well known in the art. For 
example, when the 125 l radioisotope is employed as the detectable label, covalent 
attachment of 125 l to the peptide or the peptide mimetic can be achieved by 
5 incorporating the amino acid tyrosine into the peptide or peptide mimetic and 
then iodinating the peptide (see, e.g., Weaner eta/. (1994) Synthesis and 
Applications of fsotopically Labelled Compounds, pp. 137-140). If tyrosine is 
not present in the peptide or peptide mimetic, incorporation of tyrosine to the N 
or C terminus of the peptide or peptide mimetic can be achieved by well known 
10 chemistry. Likewise, 32 P can be incorporated onto the peptide or peptide 

mimetic as a phosphate moiety through, for example, a hydroxyl group on the 
peptide or peptide mimetic using conventional chemistry. 

Labeling of peptidomimetics usually involves covalent attachment of one 
or more labels, directly or through a spacer {e.g., an amide group), to 
15 non-interfering position(s) on the peptidomimetic that are predicted by 
quantitative structure-activity data and/or molecular modeling. Such 
non-interfering positions generally are positions that do not form direct contacts 
with the macromolecules(s) to which the peptidomimetic binds to produce the 
therapeutic effect. Derivatization {e.g., labeling) of peptidomimetics should not 
20 substantially interfere with the desired biological or pharmacological activity of 
the peptidomimetic. 

Peptides, polypeptides and peptide mimetics that can bind to an MTSP25 
polypeptide or the protease domain of MTSP25 polypeptides and/or modulate 
the activity thereof, or exhibit MTSP25 polypeptide activity, can be used for 

25 treatment of neoplastic disease. The peptides, polypeptides and peptide 

mimetics can be delivered, in vivo or ex vivo, to the cells of a subject in need of 
treatment. Further, peptides which have MTSP25 polypeptide activity can be 
delivered, in vivo or ex vivo, to cells which carry mutant or missing alleles 
encoding the MTSP25 polypeptide gene. Any of the techniques described herein 

30 or known to the skilled artisan can be used for preparation and in vivo or ex vivo 
delivery of such peptides, polypeptides and peptide mimetics that are 
substantially free of other human proteins. For example, the peptides, 
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polypeptides and peptide mimetics can be readily prepared by expression in a 
microorganism or synthesis in vitro. 

The peptides or peptide mimetics can be introduced into cells, in vivo or 
ex vivo, by microinjection or by use of liposomes, for example. Alternatively, the 
5 peptides, polypeptides or peptide mimetics can be taken up by cells, in vivo or 
ex vivo, actively or by diffusion. In addition, extracellular application of the 
peptide, polypeptide or peptide mimetic can be sufficient to effect treatment of a 
neoplastic disease. Other molecules, such as drugs or organic compounds, that: 
U.bind to a MTSP25 polypeptide or protease domain thereof; or 2) have a 
10 similar function or activity to an MTSP25 polypeptide or protease domain 
thereof, can be used in methods for treatment. 

4. Rational drug design 

The goal of rational drug design is to produce structural analogs of 
biologically active polypeptides or peptides of interest or of small molecules or 

15 peptide mimetics with which they interact [e.g., agonists and antagonists) in 
order to fashion drugs which are, e.g., more active or stable forms thereof; or 
which, for example, enhance or interfere with the function of a polypeptide in 
vivo (e.g., an MTSP25 polypeptide). In one approach, one first determines the 
three-dimensional structure of a protein of interest {e.g., an MTSP25 polypeptide 

20 or polypeptide having a protease domain) or, for example, of an MTSP25 

polypeptide-ligand complex, by X-ray crystallography, by computer modeling or 
most typically, by a combination of approaches (see, e.g., Erickson et aL 1990). 
Also, useful information regarding the structure of a polypeptide can be gained 
by modeling based on the structure of homologous proteins. In addition, 

25 peptides can be analyzed by an alanine scan. In this technique, an amino acid 
residue is replaced by Ala, and its effect on the peptide's activity is determined. 
Each of the amino acid residues of the peptide is analyzed in this manner to 
determine the important regions of the peptide. 

Also, a polypeptide or peptide that binds to an MTSP25 polypeptide or, 

30 generally, the protease domain of an MTSP25 polypeptide, can be selected by a 
functional assay, and then the crystal structure of this polypeptide or peptide 
alone and in complex with MT5P25 can be determined. The polypeptide can be, 
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10 



15 



for example, an antibody specific for an MTSP25 polypeptide or the protein 
domain of an MTSP25 polypeptide. This approach can yield a pharmacophore 
upon which subsequent drug design can be based. Further, it is possible to 
bypass the crystallography altogether by generating anti-idiotypic polypeptides or 
peptides, (anti-ids) to a functional, pharmacologically active polypeptide or 
peptide that binds to an MTSP25 polypeptide or protease domain of an MTSP25 
polypeptide. As a mirror image of a mirror image, the binding site of the anti-ids 
is expected to be an analog of the original target molecule, e.g., an MTSP25 
polypeptide or polypeptide having an MTSP25 polypeptide. The anti-id could 
then be used to identify and isolate peptides from banks of chemically or 
biologically produced banks of peptides. Selected peptides would then act as 
the pharmacophore. 

Thus, one can design drugs which have, for example, improved activity or 
stability or which act as modulators (e.g., inhibitors, agonists or antagonists) of 
MTSP25 polypeptide activity, and are useful in the methods, particularly the 
methods for diagnosis, treatment, prevention, and screening of a neoplastic 
disease. By virtue of the availability of nucleic acid that encodes MTSP25 
polypeptides, sufficient amounts of the MTSP25 polypeptide can be made 
available to perform such analytical studies as X-ray crystallography. In addition, 
20 the knowledge of the amino acid sequence of an MTSP25 polypeptide or the 
protease domain thereof, e.g., the protease domain encoded by the amino acid 
sequence of SEQ ID Nos. 16 and 6, can provide guidance on computer modeling 
techniques in place of, or in addition to. X-ray crystallography. 

Methods of identifying peptides and peptide mimetics that bind to 
25 MTSP25 polypeptides 

Peptides having a binding affinity to the MTSP25 polypeptide provided 
herein (e.g., an MTSP25 polypeptide or a polypeptide having a protease domain 
of an MTSP25 polypeptide) can be readily identified, for example, by random 
peptide diversity generating systems coupled with an affinity enrichment 
30 process. Specifically, random peptide diversity generating systems include the 
"peptides on plasmids" system (see, e.g., U.S. Patent Nos. 5,270,170 and 
5,338,665); the "peptides on phage" system (see, e.g., U.S. Patent No. 
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6,121,238 and Cwiria,ef at. (1990) Proc. Natl. Acad. ScL U.S.A. 
57:6378-6382); the "polysome system;" the "encoded synthetic library (ESL)" 
system; and the "very large scale immobilized polymer synthesis" system (see, 
e.g., U.S. Patent No. 6,121,238; and Dower et at. (1991) An. Rep. Med. Chem. 
5 25:271-280 

For example, using the procedures described above, random peptides can 
generally be designed to have a defined number of amino acid residues in length 
{e.g., 12). To generate the collection of oligonucleotides encoding the random 
peptides, the codon motif (NNK)x, where N is nucleotide A, C, G, or T 

10 (equimolar; depending on the methodology employed, other nucleotides can be 
employed), K is G or T (equimolar), and x is an integer corresponding to the 
number of amino acids in the peptide (e.g., 12) can be used to specify any one 
of the 32 possible codons resulting from the NNK motif: 1 for each of 12 amino 
acids, 2 for each of 5 amino acids, 3 for each of 3 amino acids, and only one of 

1 5 the three stop codons. Thus, the NNK motif encodes all of the amino acids, 
encodes only one stop codon, and reduces codon bias. 

The random peptides can be presented, for example, either on the surface 
of a phage particle, as part of a fusion protein containing either the pill, pVI, 
pVII, pVIII or the pIX coat protein of a phage fd derivative (peptides on phage) or 

20 as a fusion protein with the Lacl peptide fusion protein bound to a plasmid 

(peptides on plasmids). The phage or plasmids, including the DNA encoding the 
peptides, can be identified and isolated by an affinity enrichment process using 
immobilized MTSP25 polypeptide having a protease domain. The affinity 
enrichment process, sometimes called "panning," typically involves multiple 

25 rounds of incubating the phage, plasmids, or polysomes with the immobilized 
MTSP25 polypeptide, collecting the phage, plasmids, or polysomes that bind to 
the MTSP25 polypeptide (along with the accompanying DNA or mRNA), and 
producing more of the phage or plasmids (along with the accompanying 
Lacl-peptide fusion protein) collected. 
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Characteristics of peptides and peptide mimetics 
Among the peptides, polypeptides and peptide mimetics for therapeutic 
application are those of having molecular weights from about 250 to about 
8,000 daltons. If such peptides are oligomerized, dimerized and/or derivatized 
5 with a hydrophilic polymer (e.g., to increase the affinity and/or activity of the 
compounds), the molecular weights of such peptides can be substantially greater 
and can range anywhere from about 500 to about 1 20,000 daltons, generally 
from about 8,000 to about 80,000 daltons. Such peptides can contain 9 or 
more amino acids that are naturally occurring or synthetic (non-naturally 

10 occurring) amino acids. One skilled in the art can determine the affinity and 
molecular weight of the peptides and peptide mimetics suitable for therapeutic 
and/or diagnostic purposes (e.g., see Dower et at., U.S. Patent No. 6,121,238). 

The peptides can be covalently attached to one or more of a variety of 
hydrophilic polymers. Suitable hydrophilic polymers include, but are not limited 

1 5 to, polyalkylethers as exemplified by polyethylene glycol and polypropylene 
glycol, polylactic acid, polyglycolic acid, polyoxyalkenes, polyvinylalcohol, 
polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and dextran 
derivatives. When the peptide compounds are derivatized with such polymers, 
their solubility and circulation half-lives can be increased with little, if any, 

20 diminishment in their binding activity. The peptide compounds can be dimerized 
and each of the dimeric subunits can be covalently attached to a hydrophilic 
polymer. The peptide compounds can be PEGylated, i.e., covalently attached to 
polyethylene glycol (PEG). 

5. Methods of preparing peptides and peptide mimetics 

25 Peptides that bind to MTSP25 polypeptides can be prepared by classical 

methods known in the art, for example, by using standard solid phase 
techniques. The standard methods include exclusive solid phase synthesis, 
partial solid phase synthesis methods, fragment condensation, classical solution 
synthesis, and even by recombinant DNA technology (see, e.g., Merrifield 

30 (1963) J. Am. Chem. Soc, 85'^A9, incorporated herein by reference.) 

Using the "encoded synthetic library" or "very large scale immobilized 
polymer synthesis" systems (see, e.g., U.S. Patent No. 5,925,525, and 
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5,902,723), the minimum size of a peptide with the activity of interest can be 
determined. In addition all peptides that form the group of peptides that differ 
from the desired motif (or the minimum size of that motif) in one, two, or more 
residues can be prepared. This collection of peptides then can be screened for 
5 the ability to bind to the target molecule, e.g., MTSP25 polypeptide or, 

generally, the protease domain of an MTSP25 polypeptide. This immobilized 
polymer synthesis system or other peptide synthesis methods can also be used 
to synthesize truncation analogs and deletion analogs and combinations of 
truncation and deletion analogs of the peptide compounds. 

10 These procedures can also be used to synthesize peptides in which amino 

acids other than the 20 naturally occurring, genetically encoded amino acids are 
substituted at one, two, or more positions of the peptide. For instance, 
naphthylalanine can be substituted for tryptophan, facilitating synthesis. Other 
synthetic amino acids that can be substituted into the peptides include 

15 L-hydroxypropyl, L-3, 4-dihydroxy-phenylalanyl, d amino acids such as 

L-d-hydroxylysyl and D-d-methylalanyl, L-a-methylalanyl, J5 amino acids, and 
isoquinolyl. D amino acids and non-naturally occurring synthetic amino acids 
can also be incorporated into the peptides (see, e.g., Roberts et al. (1983) 
Unusual Amino/ Acids in Peptide Synthesis, 5(6):341 -449). 

20 The peptides also can be modified by phosphorylation (see, e.g., W. 

Bannwarth et aL (1996) Biorganic and Medicinal Chemistry Letters, 
ff(17):2141-2146), and other methods for making peptide derivatives (see, e.g., 
Hruby et al. (1990) Biochem. J., 2£S(2):249-262). Thus, peptide compounds 
also serve as a basis to prepare peptide mimetics with similar or improved 

25 biological activity. 

Those of skill in the art recognize that a variety of techniques are 
available for constructing peptide mimetics with the same or similar desired 
biological activity as the corresponding peptide compound but with more 
favorable activity than the peptide with respect to solubility, stability, and 

30 susceptibility to hydrolysis and proteolysis (see, e.g., Morgan et a!. (1 989) An. 
Rep. Med. Chem., 24:243-252). Methods for preparing peptide mimetics 
modified at the N-terminal amino group, the C-terminal carboxyl group, and/or 
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changing one or more of the amido linkages in the peptide to a non-amido 

linkage are known to those of skill in the art. 

> 

Amino terminus modifications include, but are not limited to, alkylating, 
acetylating and adding a carbobenzoyl group, forming a succinimide group {see, 
5 e.g., Murray et at. (1995) Burger's Medicinal Chemistry and Drug Discovery, 5th 
ed., VoL 7, Manfred E. Wolf, ed., John Wiley and Sons, Inc.). C-terminal 
modifications include mimetics wherein the C-terminal carboxyl group is replaced 
by an ester, an amide or modifications to form a cyclic peptide. 

In addition to N-terminal and C-terminal modifications, the peptide 
1 0 compounds, including peptide mimetics, can advantageously be modified with or 
covalently coupled to one or more of a variety of hydrophilic polymers. It has 
been found that when peptide compounds are derivatized with a hydrophilic 
polymer, their solubility and circulation half-lives can be increased and their 
immunogenicity is masked, with little, if any, diminishment in their binding 
1 5 activity. Suitable nonproteinaceous polymers include, but are not limited to, 

polyalkylethers as exemplified by polyethylene glycol and polypropylene glycol, 
polylactic acid, polygiycolic acid, polyoxyalkenes, poiyvinylalcohol, 
polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and dextran 
derivatives. Generally, such hydrophilic polymers have an average molecular 
20 weight ranging from about 500 to about 100,000 daltons, including from about 
2,000 to about 40,000 daltons and, from about 5,000 to about 20,000 daltons. 
The hydrophilic polymers also can have an average molecular weights of about 
5,000 daltons, 10,000 daltons and 20,000 daltons. 

Methods for derivatizing peptide compounds or for coupling peptides to 
25 such polymers have been described (see, e.g., Zallipsky (1995) Bioconjugate 

Chem., 6^:150-165; Monfardini et at. (199£) Bioconjugate Chem., 5:62-69; U.S. 
Pat. No. 4,640,835; U.S. Pat. No. 4,496,689; U.S. Pat. No. 4,301,144; U.S. 
Pat. No. 4,670,417; U.S. Pat. No. 4,791,192; U.S. Pat. No. 4,179,337 and WO 
95/34326, all of which are incorporated by reference in their entirety herein). 

Other methods for making peptide derivatives are described, for example, 
in Hruby et at. (1990), Biochem J., 26>S(2):249-262, which is incorporated 
herein by reference. Thus, the peptide compounds also serve as structural 
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models for non-peptidic compounds with similar biological activity. Those of 
skill in the art recognize that a variety of techniques are available for 
constructing compounds with the same or similar desired biological activity as a 
particular peptide compound but with more favorable activity with respect to 
5 solubility, stability, and susceptibility to hydrolysis and proteolysis (see, e.g., 
Morgan eta/. (1989) An. Rep. Med. Chem., 24:243-252, incorporated herein by 
reference). These techniques include replacing the peptide backbone with a 
backbone composed of phosphonates, amidates, carbamates, sulfonamides, 
secondary amines, and N-methylamino acids. 
10 Peptide compounds can exist in a cyclized form with an intramolecular 

disulfide bond between the thiol groups of the cysteines. Alternatively, an 
intermolecular disulfide bond between the thiol groups of the cysteines can be 
produced to yield a dimeric (or higher oligomeric) compound. One or more of the 
cysteine residues can also be substituted with a homocysteine. 
15 I. Conjugates 

A conjugate, containing: a) a single chain protease domain (or 
proteolytically active portion thereof) of an MTSP25 polypeptide or a full length 
zymogen, activated form thereof, or two or single chain protease domain 
thereof; and b) a targeting agent linked to the MTSP25 polypeptide directly or 
20 via a linker, wherein the agent facilitates: i) affinity isolation or purification of the 
conjugate; ii) attachment of the conjugate to a surface; iii) detection of the 
conjugate; or iv) targeted delivery to a selected tissue or cell, is provided herein. 
The conjugate can be a chemical conjugate or a fusion protein mixture thereof. 
The targeting agent can be a protein or peptide fragment, such as a 
25 tissue specific or tumor specific monoclonal antibody or growth factor or 

fragment thereof linked either directly or via a linker to an MTSP25 polypeptide 
or a protease domain thereof. The targeting agent can also be a protein or 
peptide fragment that contains a protein binding sequence, a nucleic acid binding 
sequence, a lipid binding sequence, a polysaccharide binding sequence, or a 
30 metal binding sequence, or a linker for attachment to a solid support. In a 

particular embodiment, the conjugate contains a) the MTSP25 or portion thereof, 
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as described herein; and b) a targeting agent linked to the MTSP25 polypeptide 
directly or via a linker. 

Conjugates, such as fusion proteins and chemical conjugates, of the 
MTSP25 polypeptide with a protein or peptide fragment (or plurality thereof) that 
5 functions, for example, to facilitate affinity isolation or purification of the 

MTSP25 polypeptide domain, attachment of the MTSP25 polypeptide domain to 
a surface, or detection of the MTSP25 polypeptide domain are provided. The 
conjugates can be produced by chemical conjugation, such as via thiol linkages, 
and can be produced by recombinant means as fusion proteins. In the fusion 

10 protein, the peptide or fragment thereof is linked to either the N-terminus or C- 
terminus of the MTSP25 polypeptide domain. In chemical conjugates the 
peptide or fragment thereof can be linked anywhere that conjugation can be 
effected, and there can be a plurality of such peptides or fragments linked to a 
single MTSP25 polypeptide domain or to a plurality thereof . 

1 5 The targeting agent is for in vitro or in vivo delivery to a cell or tissue, 

and includes agents such as cell or tissue-specific antibodies, growth factors and 
other factors (including compounds) that bind to moieties expressed on specific 
cells; and other cell or tissue specific agents that promote directed delivery of a 
linked protein. The targeting agent can be one that specifically delivers the 

20 MTSP25 polypeptide to selected cells by interaction with a cell surface protein 
and internalization of conjugate or MTSP25 polypeptide portion thereof. 

These conjugates are used in a variety of methods and are particularly 
suited for use in methods of activation of prodrugs, such as prodrugs that upon 
cleavage by the particular MTSP25, which is localized at or near the targeted cell 

25 or tissue, are cytotoxic. The prodrugs are administered prior to, or 

simultaneously with, or subsequently to the conjugate. Upon delivery to the 
targeted cells, the protease activates the prodrug, which then exhibits a 
therapeutic effect, such as a cytotoxic effect. 
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1 . Conjugation 

Conjugates with linked MTSP25 polypeptide domains can be prepared 
either by chemical conjugation, recombinant DNA technology, or combinations of 
recombinant expression and chemical conjugation. The MTSP25 polypeptide 
5 domains and the targeting agent can be linked in any orientation and more than 
one targeting agent and/or MTSP25 polypeptide domain can be present in a 
conjugate. 

a. Fusion proteins 

Fusion proteins are provided herein. A fusion protein contains: a) one or 
10 a plurality of domains of an MTSP25 polypeptide and b) a targeting agent. The 
fusion proteins are generally produced by recombinant expression of nucleic 
acids that encode the fusion protein. 

b. Chemical conjugation 

To effect chemical conjugation herein, the MTSP25 polypeptide domain is 
1 5 linked via one or more selected linkers or directly to the targeting agent. 

Chemical conjugation must be used- if the targeted agent is other than a peptide 
or protein, such as a nucleic acid or a non-peptide drug. Any means known to 
those of skill in the art for chemically conjugating selected moieties can be used. 
2. Linkers 

20 Linkers for two purposes are contemplated herein. The conjugates can 

include one or more linkers between the MTSP25 polypeptide portion and the 
targeting agent. Additionally, linkers are used for facilitating or enhancing 
immobilization of an MTSP25 polypeptide or portion thereof on a solid support, 
such as a microtiter plate, silicon or silicon-coated chip, glass or plastic support, 
25 such as for high throughput solid phase screening protocols. 

Any linker known to those of skill in the art for preparation of conjugates 
can be used herein. These linkers are typically used in the preparation of 
chemical conjugates; peptide linkers can be incorporated into fusion proteins. 
Linkers can be any moiety suitable to associate a domain of MTSP25 
30 polypeptide and a targeting agent. Such linkers and linkages include, but are not 
limited to, peptidic linkages, amino acid and peptide linkages, typically containing 
between one and about 60 amino acids, more generally between about 10 and 
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30 amino acids, chemical linkers, such as heterobifunctional cleavable cross- 
linkers, including but are not limited to, N-succinimidyl (4-iodoacetyl)- 
aminobenzoate, sulfosuccinimidyl (4-iodoacetyl)-aminobenzoate, 4-succinimidyl- 
oxycarbonyl-a-(2-pyridyldithio)toluene, sulfosuccinimidyl-6-[a-methyl-a- 
5 (pyridyldithiol)-toluamido] hexanoate, N-succinimidyl-3-(-2-pyridyldithio) - 
propionate, succinimidyl 6[3(-{-2-pyridyldithio)-propionamido] hexanoate, 
sulfosuccinimidyl 6[3(-(-2-pyridyldithio)-propionamido] hexanoate, 3-(2-pyridyldi- 
thio)-propionyl hydrazide, Ellman's reagent, dichlorotriazinic acid, and S-{2- 
thiopyridyl)-L-cysteine. Other linkers include, but are not limited to peptides- and 
10 other moieties that reduce steric hindrance between the domain of MTSP25 

polypeptide and the targeting agent, intracellular enzyme substrates, linkers that 
increase the flexibility of the conjugate, linkers that increase the solubility of the 
conjugate, linkers that increase the serum stability of the conjugate, 
photocleavable linkers and acid cleavable linkers. 
15 Other exemplary linkers and linkages that are suitable for chemically 

linked conjugates include, but are not limited to, disulfide bonds, thioether 
bonds, hindered disulfide bonds, and covalent bonds between free reactive 
groups, such as amine and thiol groups. These bonds are produced using 
heterobifunctional reagents to produce reactive thiol groups on one or both of 
20 the polypeptides, and then reacting the thiol groups on one polypeptide with 

reactive thiol groups or amine groups to which reactive maleimido groups or thiol 
groups can be attached on the other. Other linkers include, acid cleavable 
linkers, such as bismaleimideothoxy propane, acid labile-transferrin conjugates 
and adipic acid diihydrazide, that would be cleaved in more acidic intracellular 
25 compartments; cross linkers that are cleaved upon exposure to UV or visible 

light; and linkers, such as various domains, such as C H 1, C H 2, and C H 3, from the 
constant region of human lgG 1 (see, Batra et al. Molecular Immunol,, 30:379- 
386 (1993)). In some embodiments, several linkers can be included in order to 
take advantage of desired properties of each linker. 
30 Chemical linkers and peptide linkers can be inserted by covalently 

coupling the linker to the domain of MTSP25 polypeptide and the targeting 
agent. The heterobifunctional agents, described below, can be used to effect 
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such covalent coupling. Peptide linkers can also be linked by expressing DNA 
encoding the linker and therapeutic agent (TA), linker and targeted agent, or 
linker, targeted agent and therapeutic agent (TA) as a fusion protein. Flexible 
linkers and linkers that increase solubility of the conjugates are contemplated for 
5 use, either alone or with other linkers are also contemplated herein. 

a) Acid cleavable, photocleavable and heat sensitive linkers 
Acid cleavable linkers, photocleavable and heat sensitive linkers can also 
be used, particularly where it can be necessary to cleave the domain of MTSP25 
polypeptide to permit it to be more readily accessible to reaction. Acid cleavable 
10 linkers include, but are not limited to, bismaleimideothoxy propane; and adipic 
acid dihydrazide linkers (see, e.g., Fattom et al. (1992) Infection & Immun. 
50:584-589) and acid labile transferrin conjugates that contain a sufficient 
portion of transferrin to permit entry into the intracellular transferrin cycling 
pathway (see, e.g., Welhoner et aL (1991) J. Biol. Chem. 265:4309-4314). 
15 Photocleavable linkers are linkers that are cleaved upon exposure to light 

(see, e.g., Goldmacher et aL (1992) Bioconj. Chem. 3:104-107, which linkers 
are herein incorporated by reference), thereby releasing the targeted agent upon 
exposure to light. Photocleavable linkers that are cleaved upon exposure to light 
are known (see, e.g., Hazum et al. (1981) in Pept, Proc. Eur. Pept. Symp., 
20 16th, Brunfeldt, K (Ed), pp. 105-1 10, which describes the use of a nitrobenzyl 
group as a photocleavable protective group for cysteine; Yen et al. (1989) 
Makromol. Chem 790:69-82, which describes water soluble photocleavable 
copolymers, including hydroxypropylmethacrylamide copolymer, glycine 
copolymer, fluorescein copolymer and methylrhodamine copolymer; Gold- 
25 macher et al. (1992) Bioconj. Chem. 3:104-107, which describes a cross-linker 
and reagent that undergoes photolytic degradation upon exposure to near UV 
light (350 nm); and Senter et al. (1985) Photochem. Photobiol 42:231-237, 
which describes nitrobenzyioxycarbonyl chloride cross linking reagents that 
produce photocleavable linkages), thereby releasing the targeted agent upon 
30 exposure to light. Such linkers would have particular use in treating 

dermatological or ophthalmic conditions that can be exposed to light using fiber 
optics. After administration of the conjugate, the eye or skin or other body part 
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can be exposed to light, resulting in release of the targeted moiety from the 
conjugate.. Such photocleavable linkers are useful in connection with diagnostic 
protocols in which it is desirable to remove the targeting agent to permit rapid 
clearance from the body of the animal. 
5 b) Other linkers for chemical conjugation 

Other linkers, include trityl linkers, particularly, derivatized 
trityl groups to generate a genus of conjugates that provide for 
release of therapeutic agents at various degrees of acidity or alkalinity. 
The flexibility thus afforded by the ability to preselect the pH range at 

10 which the therapeutic agent is released allows selection of a linker based on the 
known physiological differences between tissues in need of delivery of a 
therapeutic agent (see, e.g., U.S. Patent No. 5,61 2,474). For example, the 
acidity of tumor tissues appears to be lower than that of normal tissues. 

c) Peptide linkers 

15 The linker moieties can be peptides. Peptide linkers can be employed in 

fusion proteins and also in chemically linked conjugates. The peptide typically 
has from about 2 to about 60 amino acid residues, for example from about 5 to 
about 40, or from about 10 to about 30 amino acid residues. The length 
selected depends upon factors, such as the use for which the linker is included. 

20 Peptide linkers are advantageous when the targeting agent is 

proteinaceous. For example, the linker moiety can be a flexible spacer amino 
acid sequence, such as those known in single-chain antibody research. 
Examples of such known linker moieties include, but are not limited to, 
peptides, such as (Gly m Ser) n and (Ser m Gly) n , in which n is 1 to 6, including 1 to 

25 4 and 2 to 4, and m is 1 to 6, including 1 to 4, and 2 to 4, enzyme cleavable 
linkers and others. 

Additional linking moieties are described, for example, in Huston etaL, 
Proc. Natl. Acad. ScL U.S.A. 55:5879-5883, 1988; Whitlow, M., etaL, Protein 
Engineering 6:989-995, 1993; Newton et al. t Biochemistry 35:545-553, 1996; 

30 A. J. Cumber etaL, Bioconj. Chem. 3:397-401, 1992; Ladurner etaL, J. MoL 
Biol. 273:330-337 , 1997; and U.S. Patent. No. 4,894,443. In some 
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embodiments, several linkers can be included in order to take advantage of 
desired properties of each linker. 

3. Targeting agents 

Any agent that facilitates detection, immobilization, or purification of the 
5 conjugate is contemplated for use herein. For chemical conjugates any moiety 
that has such properties is contemplated; for fusion proteins, the targeting agent 
is a protein, peptide or fragment thereof that is sufficient to effects the targeting 
activity. Contemplated targeting agents include those that deliver the MTSP25 
polypeptide or portion thereof to selected cells and tissues. Such agents include 
10 tumor specific monoclonal antibodies and portions thereof, growth factors, such 
as FGF, EGF, PDGF, VEGF, cytokines, including chemokines, and other such 
agents. 

4. Nucleic acids, plasmids and cells 

Isolated nucleic acid fragments encoding fusion proteins are provided. 

15 The nucleic acid fragment that encodes the fusion protein includes: a) nucleic 
acid encoding a protease domain of an MTSP25 polypeptide; and b) nucleic acid 
encoding a protein, peptide or effective fragment thereof that facilitates: i) 
affinity isolation or purification of the fusion protein; ii) attachment of the fusion 
protein to a surface; or iii) detection of the fusion protein. Generally, the nucleic 

20 acid is DNA. 

Plasmids for replication and vectors for expression that contain the above 
nucleic acid fragments are also provided. Cells containing the plasmids and 
vectors are also provided. The cells can be any suitable host including, but are 
not limited to, bacterial cells, yeast cells, fungal cells, plant cells, insect cell and 
25 animal cells. The nucleic acids, plasmids, and cells containing the plasmids can 
be prepared according to methods known in the art including any described 
herein. 

Also provided are methods for producing the above fusion proteins. An 
exemplary method includes the steps of growing, for example, culturing the cells 
30 so that they proliferate, ceils containing a plasmid encoding the fusion protein 
under conditions whereby the fusion protein is expressed by the cell, and 
recovering the expressed fusion protein. Methods for expressing and recovering 
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recombinant proteins are well known in the art (See generally, Current Protocols 
in Molecular Biology (1998) § 16, John Wiley & Sons, Inc.) and such methods 
can be used for expressing and recovering the expressed fusion proteins. 

The recovered fusion proteins can be isolated or purified by methods 
5 known in the art such as centrifugation, filtration, chromatography, 
electrophoresis, immunoprecipitation, and other such methods, or by a 
combination thereof (See generally, Current Protocols in Molecular Biology 
(1998) § 10, John Wiley & Sons, Inc.). Generally the recovered fusion protein is 
isolated or purified through affinity binding between the protein or peptide 
10 fragment of the fusion protein and an affinity binding moiety. As discussed in 
the above sections regarding the construction of the fusion proteins, any affinity 
binding pairs can be constructed and used in the isolation or purification of the 
fusion proteins. For example, the affinity binding pairs can be protein binding 
sequences/protein, DNA binding sequences/DNA sequences, RNA binding 
1 5 sequences/RNA sequences, lipid binding sequences/lipid, polysaccharide binding 
sequences/polysaccharide, or metal binding sequences/metal. 

5. Immobilization and supports or substrates therefor 
In certain embodiments, where the targeting agents are designed for 
linkage to surfaces, the MTSP25 polypeptide can be attached by linkage such as 
20 ionic or covalent, non-covalent or other chemical interaction, to a surface of a 
support or matrix material. Immobilization can be effected directly or via a 
linker. The MTSP25 polypeptide can be immobilized on any suitable support, 
including, but are not limited to, silicon chips, and other supports described 
herein and known to those of skill in the art. A plurality of MTSP25 polypeptide 
25 or protease domains thereof can be attached to a support, such as an array (i.e., 
a pattern of two or more) of conjugates on the surface of a silicon chip or other 
chip for use in high throughput protocols and formats. 

It is also noted that the domains of the MTSP25 polypeptide can be 
linked directly to the surface or via a linker without a targeting agent linked 
B0 thereto. Hence chips containing arrays of the domains of the MTSP25 
polypeptide are also provided. 
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The matrix material or solid supports contemplated herein are generally 
any of the insoluble materials known to those of skill in the art to immobilize 
ligands and other molecules, and are those that are used in many chemical 
syntheses and separations. Such supports are used, for example, in affinity 
5 chromatography, in the immobilization of biologically active materials, and during 
chemical syntheses of biomolecules, including proteins, amino acids and other 
organic molecules and polymers. The preparation of and use of supports is well 
known to those of skill in this art; there are many such materials and 
preparations thereof known. For example, naturally-occurring support materials, 
10 such as agarose and cellulose, can be isolated from their respective sources, and 
processed according to known protocols, and synthetic materials can be 
prepared in accord with known protocols. 

The supports are typically insoluble materials that are solid, porous, 
deformable, or hard, and have any required structure and geometry, including, 
15 but not limited to: beads, pellets, disks, capillaries, hollow fibers, needles, solid 
fibers, random shapes, thin films and membranes. Thus, the item can be 
fabricated from the matrix material or combined with it, such as by coating all or 
part of the surface or impregnating particles. 

Typically, when the matrix is particulate, the particles are at least about 
20 1 0-2000 //m, but can be smaller or larger, depending upon the selected 

application. Selection of the matrices is governed, at least in part, by their 
physical and chemical properties, such as solubility, functional groups, 
mechanical stability, surface area swelling propensity, hydrophobic or hydrophilic 
properties and intended use. 
25 If necessary, the support matrix material can be treated to contain an 

appropriate reactive moiety. In some cases, the support matrix material already 
containing the reactive moiety can be obtained commercially. The support 
matrix material containing the reactive moiety can thereby serve as the matrix 
support upon which molecules are linked. Materials containing reactive surface 
30 moieties such as amino silane linkages, hydroxyl linkages or carboxysilane 
linkages can be produced by well established surface chemistry techniques 
involving silanization reactions, or the like. Examples of these materials are 
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those having surface silicon oxide moieties, covalently linked to gamma-amino- 
propylsilane, and other organic moieties; N-[3-(triethyoxysilyl)propyl]phthelamic 
acid; and bis-(2-hydroxyethyl)aminopropyltriethoxysilane. Exemplary of readily 
available materials containing amino group reactive functionalities, include, but 
5 are not limited to, para-aminophenyltriethyoxysilane. Also derivatized 

polystyrenes and other such polymers are well known and readily available to 
those of skill in this art {e.g., the Tentagel® Resins are available with a multitude 
of functional groups, and are sold by Rapp Polymere, Tubingen, Germany; see, 
U.S. Patent No. 4,908,405 and U.S. Patent No. 5,292,814; see, also Butz et al., 
10 Peptide Res., 7:20-23 (1994); and Kleine et al., Immunobiol., 190:53-66 
(1994)). 

These matrix materials include any material that can act as a support 
matrix for attachment of the molecules of interest. Such materials are known to 
those of skill in this art, and include those that are used as a support matrix. 
15 These materials include, but are not limited to, inorganics, natural polymers, and 
synthetic polymers, including, but are not limited to: cellulose, cellulose 
derivatives, acrylic resins, glass, silica gels, polystyrene, gelatin, polyvinyl 
pyrrolidone, co-polymers of vinyl and acrylamide, polystyrene cross-linked with 
divinylbenzene and others {see, Merrifield, Biochemistry, 3:1385-1390 (1964)), 

20 polyacrylamides, latex gels, polystyrene, dextran, polyacrylamides, rubber, 

silicon, plastics, nitrocellulose, celluloses, natural sponges. Of particular interest 
herein, are highly porous glasses (see r e.g., U.S. Patent No. 4,244,721) and 
others prepared by mixing a borosilicate, alcohol and water. 

Synthetic supports include, but are not limited to: acrylamides, dextran- 

25 derivatives and dextran co-polymers, agarose-polyacrylamide blends, other 
polymers and co-polymers with various functional groups, methacrylate 
derivatives and co-polymers, polystyrene and polystyrene copolymers (see r e.g., 
Merrifield, Biochemistry, 3:1385-1390 (1964); Berg et al., in innovation 
Perspect. Solid Phase Synth. Collect. Pap., Int. Symp., 1st, Epton, Roger (Ed), 

30 pp. 453-459 (1990); Berg et al., Pept, Proc. Eur. Pept. Symp., 20th, Jung, G. 
et aL (Eds), pp. 196-198 (1989); Berg et al., J. Am. Chem. Soc, 
111:8024-8026 (1989); Kent et al., isr. J. Chem., 17:243-247 (1979); Kent et 
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al., J. Org. Chem., 43:2845-2852 (1978); Mitchell et al. # Tetrahedron Lett., 
42:3795-3798 (1976); U.S. Patent No. 4,507,230; U.S. Patent No. 4,006,117; 
and U.S. Patent No. 5,389,449). Such materials include those made from 
polymers and co-polymers such as polyvinylalcohols, acrylates and acrylic acids 
5 such as polyethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethy- 
lene-co-ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co-acrylic 
acid, polypropylene-co-methyl-acrylic acid, polypropyiene-co-ethylacrylate, 
polypropylene-co-methyl acrylate, polyethylene-co-vinyl acetate, poly- 
propylene-co-vinyl acetate, and those containing acid anhydride groups such as 

10 polyethylene-co-maleic anhydride and polypropylene-co-maleic anhydride. 

Liposomes have also been used as solid supports for affinity purifications (Powell 
et al. BlotechnoL Bioeng., 33:173 (1989)). 

Numerous methods have been developed for the immobilization of 
proteins and other biomolecules onto solid or liquid supports (see, e.g., 

1 5 Mosbach, Methods in Enzymofogy, 44 (1 976); Weetall, Immobilized Enzymes, 
Antigens, Antibodies, and Peptides, (1975); Kennedy et al., Solid Phase 
Biochemistry, Analytical and Synthetic Aspects, Scouten, ed., pp. 253-391 
(1983); see, generally, Affinity Techniques. Enzyme Purification: Part B. 
Methods in Enzymofogy, Vol. 34, ed. W. B. Jakoby, M. Wilchek, Acad. Press, 

20 N.Y. (1974); and Immobilized Biochemicals and Affinity Chromatography, 

Advances In Experimental Medicine and Biology, vol. 42, ed. R. Dunlap, Plenum 
Press, N.Y. (1974)). 

Among the most commonly used methods are absorption and adsorption 
or covalent binding to the support, either directly or via a linker, such as the 

25 numerous disulfide linkages, thioether bonds, hindered disulfide bonds, and 

covalent bonds between free reactive groups, such as amine and thiol groups, 
known to those of skill in art (see, e.g., the PIERCE CATALOG, 
ImmunoTechnology Catalog & Handbook, 1992-1993, which describes the 
preparation of and use of such reagents and provides a commercial source for 

30 such reagents; Wong, Chemistry of Protein Conjugation and Cross Linking, CRC 
Press (1993); see also DeWitt et al., Proc. NatL Acad. Scl. U.S.A., 90:6909 
(1993); Zuckermann et aL, J. Am. Chem. Soc, 1 14 :10646 (1992); Kurth et al., 
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J. Am. Chem. Soc, 116:2661 (1994); Ellman et al., Proc. Natl. Acad. Set. 
U.S.A., 91:4708 (1994); Sucholeiki, Tetrahedron Lttrs., 35:7307 (1994); Su- 
Sun Wang, J. Org. Chem., 41:3258 (1976); Padwa et al., J. Org. Chem., 
41:3550 (1971); and Vedejs et al., J. Org. Chem., 49:575 (1984), which 
5 describe photosensitive linkers). 

To effect immobilization, a composition containing the protein or other 
biomolecule is contacted with a support material such as alumina, carbon, an 
ion-exchange resin, cellulose, glass or a ceramic. Fluorocarbon polymers have 
been used as supports to which biomolecules have been attached by adsorption 
10 (see, U.S. Patent No. 3,843,443; Published International PCT Application 
WO/86 03840). 
J. Prognosis and diagnosis 

MTSP25 polypeptide proteins, domains, analogs, and derivatives thereof, 
and encoding nucleic acids (and sequences complementary thereto), and anti- 
1 5 MTSP25 polypeptide antibodies, can be used in diagnostics, particularly 

diagnosis of lung, head and neck, such as esophageal tumors, prostate, colon, 
ovary, cervix, breast and pancreas cancers. Such molecules can be used in 
assays, such as immunoassays, to detect, prognose, diagnose, or monitor 
various conditions, diseases, and disorders affecting MTSP25 polypeptide 
20 expression, or monitor the treatment thereof. For purposes herein, the presence 
of MTSP25s in body fluids or tumor tissues are of particular interest. 

In particular, such an immunoassay is carried out by a method including 
contacting a sample derived from a patient with an anti-MTSP25 polypeptide 
antibody under conditions such that specific binding can occur, and detecting or 
25 measuring the amount of any specific binding by the antibody. Such binding of 
antibody, in tissue sections, can be used to detect aberrant MTSP25 polypeptide 
localization or aberrant (e.g., increased, decreased or absent) levels of MTSP25 
polypeptide. In a specific embodiment, antibody to an MTSP25 polypeptide can 
be used to assay in a patient tissue or body fluid, such as serum, sample for the 
30 presence of MTSP25 polypeptide where an aberrant level of MTSP25 
polypeptide is an indication of a diseased condition. 
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The immunoassays which can be used include but are not limited to 
competitive and non-competitive assay systems using techniques such as 
western blots, radioimmunoassays, ELISA (enzyme linked immunosorbent 
assay), "sandwich" immunoassays, immunoprecipitation assays, precipitin 
5 reactions, gel diffusion precipitin reactions, immunodiffusion assays, 

agglutination assays, complement-fixation assays, immunoradiometric assays, 
fluorescent immunoassays and protein A immunoassays. 

MTSP25 polypeptide genes and related nucleic acid sequences and 
subsequences, including complementary sequences, also can be used in 

10 hybridization assays. MTSP25 polypeptide nucleic acid sequences, or 

subsequences thereof containing about at least 8 nucleotides, generally 14 or 16 
or 30 or more, generally less than 1000 or up to 100, contiguous nucleotides 
can be used as hybridization probes; Hybridization assays can be used to 
detect, prognose, diagnose, or monitor conditions, disorders, or disease states 

15 associated with aberrant changes in MTSP25 polypeptide expression and/or 
activity as described herein. In particular, such a hybridization assay is carried 
out by a method by contacting a sample containing nucleic acid with a nucleic 
acid probe capable of hybridizing to MTSP25 polypeptide encoding DNA or RNA, 
under conditions such that hybridization can occur, and detecting or measuring 

20 any resulting hybridization. 

In a specific embodiment, a method of diagnosing a disease or disorder 
characterized by detecting an aberrant level of an MTSP25 polypeptide in a 
subject is provided herein by measuring the level of the DNA, RNA, protein or 
functional activity of the MTSP25 polypeptide in a sample derived from the 

25 subject, wherein an increase or decrease in the level of the DNA, RNA, protein or 
functional activity of the MTSP25 polypeptide, relative to the level of the DNA, 
RNA, protein or functional activity found in an analogous sample not having the 
disease or disorder indicates the presence of the disease or disorder in the 
subject. 

30 Kits for diagnostic use are also provided, that contain in one or more 

containers an anti-MTSP25 polypeptide antibody, and, optionally, a labeled 
binding partner to the antibody. Alternatively, the anti-MTSP25 polypeptide 
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antibody can be labeled (with a detectable marker, e.g., a chemiluminescent, 
enzymatic, fluorescent, or radioactive moiety). A kit is also provided that 
includes in one or more containers a nucleic acid probe capable of hybridizing to 
the MTSP25 polypeptide-encoding nucleic acid. In a specific embodiment, a kit 
5 can include in one or more containers a pair of primers (e.g., each in the size 
range of 6-30 nucleotides) that are capable of priming amplification [e.g., by 
polymerase chain reaction (see e.g., Innis et al., 1990, PCR Protocols, Academic 
Press, Inc., San Diego, CA), ligase chain reaction (see EP 320,308) use of Ofi 
replicase, cyclic probe reaction, or other methods known in the art under 
10 appropriate reaction conditions of at least a portion of an MTSP25 polypeptide- 
encoding nucleic acid. A kit can optionally further include in a container a 
predetermined amount of a purified MTSP25 polypeptide or nucleic acid, e.g., for 
use as a standard or control. 

K. Pharmaceutical compositions and modes of administration 

1 5 1 . Components of the compositions 

Pharmaceutical compositions containing the identified compounds that 
modulate the activity of an MTSP25 polypeptide are provided herein. Also 
provided are combinations of a compound that modulates the activity of an 
MTSP25 polypeptide and another treatment or compound for treatment of a 

20 neoplastic disorder, such as a chemotherapeutic compound. 

The MTSP25 polypeptide modulator and the anti-tumor agent can be 
packaged as separate compositions for administration together or sequentially or 
intermittently. Alternatively, they can provided as 

a single composition for administration or as two compositions for administration 
25 as a single composition. The combinations can be packaged as kits. 

a. MTSP25 polypeptide inhibitors 
Any MTSP25 polypeptide inhibitors, including those described herein 
when used alone or in combination with other compounds, that can alleviate, 
reduce, ameliorate, prevent, or place or maintain in a state of remission of 
30 clinical symptoms or diagnostic markers associated with neoplastic diseases, 

including undesired and/or uncontrolled angiogenesis, can be used in the present 
combinations. 
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In one embodiment, the MTSP25 polypeptide inhibitor is an antibody or 
fragment thereof that specifically reacts with an MTSP25 polypeptide or the 
protease domain thereof, an inhibitor of the MTSP25 polypeptide production, an 
inhibitor of MTSP25 polypeptide membrane-localization, or any inhibitor of the 
5 expression of or, especially, the activity of an MTSP25 polypeptide. 

b. Anti-angiogenic agents and anti-tumor agents 

Any anti-angiogenic agents and anti-tumor agents, including'those 
described herein, when used alone or in combination with other compounds, that 
can alleviate, reduce, ameliorate, prevent, or place or maintain in a state of 

10 remission of clinical symptoms or diagnostic markers associated with undesired 
and/or uncontrolled angiogenesis and/or tumor growth and metastasis, 
particularly solid neoplasms, vascular malformations and cardiovascular 
disorders, chronic inflammatory diseases and aberrant wound repairs, circulatory 
disorders, crest syndromes, dermatological disorders, or ocular disorders, can be 

15 used in the combinations. Also contemplated are anti-tumor agents for use in 
combination with an inhibitor of an MTSP25 polypeptide. ' 

c. Anti-tumor agents and anti-angiogenic agents 

The compounds identified by the methods provided herein or provided 
herein can be used in combination with anti-tumor agents and/or anti- 
20 angiogenesis agents. 

2. Formulations and route of administration 

The compounds herein and agents can be formulated as pharmaceutical 
compositions, typically for single dosage administration. The concentrations of 
the compounds in the formulations are effective for delivery of an amount, upon 

25 administration, that is effective for the intended treatment. Typically, the 

compositions are formulated for single dosage administration. To formulate a 
composition, the weight fraction of a compound or mixture thereof is dissolved, 
suspended, dispersed or otherwise mixed in a selected vehicle at an effective 
concentration such that the treated condition is relieved or ameliorated. 

30 Pharmaceutical carriers or vehicles suitable for administration of the compounds 
provided herein include any such carriers known to those skilled in the art to be 
suitable for the particular mode of administration. 
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In addition, the compounds can be formulated as the sole 
pharmaceutical^ active ingredient in the composition or can be combined with 
other active ingredients. Liposomal suspensions, including tissue-targeted 
liposomes, can also be suitable as pharmaceutical^ acceptable carriers. These 
5 can be prepared according to methods known to those skilled in the art. For 
example, liposome formulations can be prepared as described in U.S. Patent No. 
4,522,811. 

The active compound is included in the pharmaceutical^ acceptable 
carrier in an amount sufficient to exert a therapeutically useful effect in the 
10 absence of undesirable side effects on the patient treated. The therapeutically 
effective concentration can be determined empirically by testing the compounds 
in known jn vitro and in vivo systems, such as the assays provided herein. 

The concentration of active compound in the drug composition depends 
on absorption, inactivation and excretion rates of the active compound, the 
15 physicochemical characteristics of the compound, the dosage schedule, and 
amount administered as well as other factors known to those of skill in the art. 

Typically a therapeutically effective dosage is contemplated. The 
amounts administered can be on the order of 0.001 to 1 mg/ml, including about 
0.005-0.05 mg/ml and about 0.01 mg/ml, of blood volume. Pharmaceutical 
20 dosage unit forms are prepared to provide from about 1 mg to about 1000 mg, 
including from about 10 to about 500 mg, and including about 25-75 mg of the 
essential active ingredient or a combination of essential ingredients per dosage 
unit form. The precise dosage can be empirically determined. 

The active ingredient can be administered at once, or can be divided into 
25 a number of smaller doses to be administered at intervals of time. It is 

understood that the precise dosage and duration of treatment is a function of the 
disease being treated and can be determined empirically using known testing 
protocols or by extrapolation from ]n vivo or m vitro test data. It is to be noted 
that concentrations and dosage values can also vary with the severity of the 
30 condition to be alleviated. It is to be further understood that for any particular 
subject, specific dosage regimens should be adjusted over time according to the 
individual need and the professional judgment of the person administering or 
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supervising the administration of the compositions, and that the concentration 
ranges set forth herein are exemplary only and are not intended to limit the 
scope or use of the claimed compositions and combinations containing them. 
Pharmaceutically acceptable derivatives include acids, salts, esters, 
5 hydrates, solvates and prodrug forms. The derivative is typically selected such 
that its pharmacokinetic properties are superior to the corresponding neutral 
compound. 

Thus, effective concentrations or amounts of one or more of the 
compounds provided herein or pharmaceutically acceptable derivatives thereof 
1 0 : are mixed with a suitable pharmaceutical carrier or vehicle for systemic, topical 
or local administration to form pharmaceutical compositions. Compounds are 
included in an amount effective for ameliorating or treating the disorder for 
which treatment is contemplated. The concentration of active compound in the 
composition depends on absorption, inactivation, excretion rates of the active 
1 5 compound, the dosage schedule, amount administered, particular formulation as 
well as other factors known to those of skill in the art. 

Solutions or suspensions used for parenteral, intradermal, subcutaneous, 
or topical application can include any of the following components: a sterile 
diluent, such as water for injection, saline solution, fixed oil, polyethylene glycol, 
20 glycerine, propylene glycol or other synthetic solvent; antimicrobial agents, such 
as benzyl alcohol and methyl parabens; antioxidants, such as ascorbic acid and 
sodium bisulfite; chelating agents, such as ethylenediaminetetraacetic acid 
(EDTA); buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 
25 preparations can be enclosed in ampules, disposable syringes or single or 
multiple dose vials made of glass, plastic or other suitable material. 

In instances in which the compounds exhibit insufficient solubility, 
methods for solubilizing compounds can be used. Such methods are known to 
those of skill in this art, and include, but are not limited to, using cosolvents, 
30 such as dimethylsulfoxide (DMSO), using surfactants, such as Tween®, or 

dissolution in aqueous sodium bicarbonate. Derivatives of the compounds, such 
as prodrugs of the compounds can also be used in formulating effective 
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pharmaceutical compositions. For ophthalmic indications, the compositions are 
formulated in an ophthalmically acceptable carrier. For the ophthalmic uses 
herein, local administration, either by topical administration or by injection are 
contemplated. Time release formulations are also desirable. Typically, the 
5 compositions are formulated for single dosage administration, so that a single 
dose administers an effective amount. 

Upon mixing or addition of the compound with the vehicle, the resulting 
mixture can be a solution, suspension, emulsion or other composition. The form 
of the resulting mixture depends upon a number of factors, including the 
10 intended mode of administration and the solubility of the compound in the 

selected carrier or vehicle. If necessary, pharmaceutical^ acceptable salts or .. 
other derivatives of the compounds are prepared. 

The compound is included in the pharmaceutical^ acceptable carrier in an 
amount sufficient to exert a therapeutically useful effect in the absence of 
15 undesirable side effects on the patient treated. It is understood that number and 
degree of side effects depends upon the condition for which the compounds are 
administered. For example, certain toxic and undesirable side effects are 
tolerated when treating life-threatening illnesses that would not be tolerated 
when treating disorders of lesser consequence. 
20 The compounds also can be mixed with other active materials, that do 

not impair the desired action, or with materials that supplement the desired 
action known to those of skill in the art. The formulations of the compounds 
and agents for use herein include those suitable for oral, rectal, topical, 
inhalational, buccal (e.g., sublingual), parenteral (e.g., subcutaneous, 
25 intramuscular, intradermal, or intravenous), transdermal administration or any 
route. The most suitable route in any given case depends on the nature and 
severity of the condition being treated and on the nature of the particular active 
compound which is being used. The formulations are provided for administration 
to humans and animals in unit dosage forms, such as tablets, capsules, pills, 
30 powders, granules, sterile parenteral solutions or suspensions, and oral solutions 
or suspensions, and oil-water emulsions containing suitable quantities of the 
compounds or pharmaceutical^ acceptable derivatives thereof. The 
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pharmaceutically therapeutically active compounds and derivatives thereof are 
typically formulated and administered in unit-dosage forms or multiple-dosage 
forms. Unit-dose forms as used herein refers to physically discrete units suitable 
for human and animal subjects and packaged individually as is known in the art. 
5 Each unit-dose contains a predetermined quantity of the therapeutically active 
compound sufficient to produce the desired therapeutic effect, in association 
with the required pharmaceutical carrier, vehicle or diluent. Examples of 
unit-dose forms include ampoules and syringes and individually packaged tablets 
or capsules. Unit-dose forms can be administered in fractions or multiples 
10 thereof. A multiple-dose form is a plurality of identical unit-dosage forms 

packaged in a single container to be administered in segregated unit-dose form. 
Examples of multiple-dose forms include vials, bottles of tablets or capsules or 
bottles of pints or gallons. Hence, multiple dose form is a multiple of unit-doses 
which are not segregated in packaging. 
1 5 The composition can contain along with the active ingredient: a diluent 

such as lactose, sucrose, dicalcium phosphate, or carboxymethylcellulose; a 
lubricant, such as magnesium stearate, calcium stearate and talc; and a binder 
such as starch, natural gums, such as gum acacia, gelatin, glucose, molasses, 
polvinylpyrroiidine, celluloses and derivatives thereof, povidone, crospovidones 
20 and other such binders known to those of skill in the art. Liquid 

pharmaceutical^ administrate compositions can, for example, be prepared by 
dissolving, dispersing, or otherwise mixing an active compound as defined above 
and optional pharmaceutical adjuvants in a carrier, such as, for example, water, 
saline, aqueous dextrose, glycerol, glycols, ethanol, and the like, to thereby form 
25 a solution or suspension. If desired, the pharmaceutical composition to be 

administered can also contain minor amounts of nontoxic auxiliary substances 
such as wetting agents, emulsifying agents, or solubilizing agents, pH buffering 
agents and the like, for example, acetate, sodium citrate, cyclodextrine 
derivatives, sorbitan monolaurate, triethanolamine sodium acetate, 
30 triethanolamine oleate, and other such agents. Methods of preparing such 
dosage forms are known, or will be apparent, to those skilled in this art (see, 
e.g.. Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, 
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Pa., 15th Edition, 1975). The composition or formulation to be administered 
contains a quantity of the active compound in an amount sufficient to alleviate 
the symptoms of the treated subject. 

Dosage forms or compositions containing active ingredient in the range of 
5 0.005% to 100% with the balance made up from non-toxic carrier can be 

prepared. For oral administration, the pharmaceutical compositions can take the 
form of, for example, tablets or capsules prepared by conventional means with 
pharmaceutical^ acceptable excipients such as binding agents (e.g., 
pregelatinized maize starch, polyvinyl pyrrolidone or hydroxypropyl 
10 methylcellulose); fillers [e.g., lactose, microcrystalline cellulose or calcium 
hydrogen phosphate); lubricants {e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch glycolate); or wetting agents 
(e.g., sodium lauryl sulphate). The tablets can be coated by methods well- 
known in the art. 

1 5 The pharmaceutical preparation can also be in liquid form, for example, 

solutions, syrups or suspensions, or can be presented as a drug product for 
reconstitution with water or other suitable vehicle before use. Such liquid 
preparations can be prepared by conventional means with pharmaceutical^ 
acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose 

20 derivatives or hydrogenated edible fats); emulsifying agents (e.g., lecithin or 
acacia); non-aqueous vehicles (e.g., almond oil, oily esters, or fractionated 
vegetable oils); and preservatives (e.g., methyl or propyl-p-hydroxybenzoates or 
sorbic acid). 

Formulations suitable for rectal administration can be presented as unit 
25 dose suppositories. These can be prepared by admixing the active compound 

with one or more conventional solid carriers, for example, cocoa butter, and then 
shaping the resulting mixture. 

Formulations suitable for topical application to the skin or to the eye 
generally are formulated as an ointment, cream, lotion, paste, gel, spray, aerosol 
30 and oil. Carriers which can be used include vaseline, lanoline, polyethylene 
glycols, alcohols, and combinations of two or more thereof. The topical 
formulations can further advantageously contain 0.05 to 15 percent by weight 
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of thickeners selected from among hydroxypropyl methyl cellulose, methyl 
cellulose, polyvinylpyrrolidone, polyvinyl alcohol, poly (alkyiene glycols), 
poly/hydroxyalkyl, (meth)acrylates or poly(meth)acrylamides. A topical 
formulation is often applied by instillation or as an ointment into the conjunctival 
5 sac. It also can be used for Irrigation or lubrication of the eye, facial sinuses, 
and external auditory meatus. It can also be injected into the anterior eye 
chamber and other places. The topical formulations in the liquid state can be 
also present in a hydrophiiic three-dimensional polymer matrix in the form of a 
strip, contact lens, and the like from which the active components are released. 
10 For administration by inhalation, the compounds for use herein can be 

delivered in the form of an aerosol spray presentation from pressurized packs or 
a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other 
suitable gas. In the case of a pressurized aerosol, the dosage unit can be 
1 5 determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of, e.g., gelatin, for use in an inhaler or insufflator can be formulated 
containing a powder mix of the compound and a suitable powder base such as 
lactose or starch. 

Formulations suitable for buccal (sublingual) administration include, for 
20 example, lozenges containing the active compound in a flavored base, usually 
• sucrose and acacia or tragacanth; and pastilles containing the compound in an - 
inert base such as gelatin and glycerin or sucrose and acacia. 

The compounds can be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for 
25 injection can be presented in unit dosage form, e.g., in ampules or in multi-dose 
containers, with an added preservative. The compositions can be suspensions, 
solutions or emulsions in oily or aqueous vehicles, and can contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active ingredient can be in powder form for reconstitution with a suitable 
30 vehicle, e.g., sterile pyrogen-free water or other solvents, before use. 

Formulations suitable for transdermal administration can be presented as 
discrete patches adapted to remain in intimate contact with the epidermis of the 
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recipient for a prolonged period of time. Such patches suitably contain the 
active compound as an optionally buffered aqueous solution of, for example, 0,1 
to 0.2 M concentration with respect to the active compound. Formulations 
suitable for transdermal administration can also be delivered by iontophoresis 
5 (see, e.g., Pharmaceutical Research 3 (6), 318 (1986)) and typically take the 

form of an optionally buffered aqueous solution of the active compound. 

> 

The pharmaceutical compositions can also be administered by controlled 
release means and/or delivery devices (see, e.g., in U.S. Patent Nos. 3,536,809; 
3,598,123; 3,630,200; 3,845,770; 3,847,770; 3,916,899; 4,008,719; 

10 4,687,610; 4,769,027; 5,059,595; 5,073,543; 5,120,548; 5,354,566; 
5,591,767; 5,639,476; 5,674,533 and 5,733,566). 

Desirable blood levels can be maintained by a continuous infusion of the 
active agent as ascertained by plasma levels. It should be noted that the 
attending physician would know how to and when to terminate, interrupt or 

1 5 adjust therapy to lower dosage due to toxicity, or bone marrow, liver or kidney 
dysfunctions. Conversely, the attending physician would also know how to and 
when to adjust treatment to higher levels if the clinical response is not adequate 
(precluding toxic side effects). 

The efficacy and/or toxicity of the MTSP25 polypeptide inhibitor(s), alone 
20 or in combination with other agents also can be assessed by the methods known 
in the art (See generally, O'Reilly, Investigational New Drugs, J_5:5-13 (1997)). 

The active compounds or pharmaceutically acceptable derivatives can be 
prepared with carriers that protect the compound against rapid elimination from 
the body, such as time release formulations or coatings. 
25 Kits containing the compositions and/or the combinations with 

instructions for administration thereof are provided. The kit can further include a 
needle or syringe, typically packaged in sterile form, for injecting the complex, 
and/or a packaged alcohol pad. Instructions are optionally included for 
administration of the active agent by a clinician or by the patient. 
30 Finally, the compounds or MTSP25 polypeptides or protease domains 

thereof or compositions containing any of the preceding agents can be packaged 
as articles of manufacture containing packaging material, a compound or suitable 
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derivative thereof provided herein, which is effective for treatment of a diseases 
or disorders contemplated herein, within the packaging material, and a label that 
indicates that the compound or a suitable derivative thereof is for treating the 
diseases or disorders contemplated herein. The label can optionally include the 
5 disorders for which the therapy is warranted. 
L. Methods of treatment 

The compounds identified by the methods herein are used for treating or 
preventing neoplastic diseases in an animal, particularly a mammal, including a 
human, is provided herein. In one embodiment, the method includes 
10 administering to a mammal an effective amount of an inhibitor of an MTSP25 
polypeptide, whereby the disease or disorder is treated or prevented. 

In an embodiment, the MTSP25 polypeptide inhibitor used in the 
treatment or prevention is administered with a pharmaceutical^ acceptable 
carrier or excipient. The mammal treated can be a human. The inhibitors 
15 provided herein are those identified by the screening assays. In addition, 

antibodies and antisense nucleic acids or double-stranded RNA (dsRNA), such as 
RNAi, are contemplated. 

The treatment or prevention method can further include administering an 
anti-angiogenic treatment or agent or anti-tumor agent simultaneously with, prior 

20 to or subsequent to the MTSP25 polypeptide inhibitor, which can be any 

compound identified that inhibits the activity of an MTSP25 polypeptide. Such 
compounds include small molecule modulators, an antibody or a fragment or 
derivative thereof containing a binding region thereof against the MTSP25 
polypeptide, an antisense nucleic acid or double-stranded RNA (dsRNA), such as 

25 RNAi, encoding an a portion of the MTSP25 polypeptide or complementary 

thereto, and a nucleic acid containing at least a portion of a gene encoding the 
MTSP25 polypeptide into which a heterologous nucleotide sequence has been 
inserted such that the heterologous sequence inactivates the biological activity 
of at least a portion of the gene encoding the MTSP25 polypeptide, in which the 
30 portion of the gene encoding the MTSP25 polypeptide flanks the heterologous 
sequence to promote homologous recombination with a genomic gene encoding 
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the MTSP25 polypeptide. In addition, such molecules are generally less than 
about 1000 nt long. 

1 . Antisense treatment 

In a specific embodiment, as described hereinabove, MTSP25 polypeptide 
5 function is reduced or inhibited by MTSP25 polypeptide antisense nucleic acids, 
to treat or prevent neoplastic disease. The therapeutic or prophylactic use of 
nucleic acids of at least six nucleotides, generally up to about 1 50 nucleotides, 
that are antisense to a gene or cDNA encoding MTSP25 polypeptide or a portion 
thereof is provided. An MTSP25 polypeptide "antisense" nucleic acid as used 

10 herein refers to a nucleic acid capable of hybridizing to a portion of an MTSP25 
polypeptide RNA (generally mRNA) by virtue of some sequence complementarity, 
and generally under high stringency conditions. The antisense nucleic acid can 
be complementary to a coding and/or noncoding region of an MTSP25 
polypeptide mRNA. Such antisense nucleic acids have utility as therapeutics 

15 that reduce or inhibit MTSP25 polypeptide function, and can be used in the 
treatment or prevention of disorders as described supra. 

The MTSP25 polypeptide antisense nucleic acids are of at least six 
nucleotides and are generally oligonucleotides (ranging from 6 to about 150 

nucleotides including 6 to 50 nucleotides). The antisense molecule can be 

» 

20 complementary to all or a portion of the protease domain. For example, the 
oligonucleotide is at least 6 nucleotides, at least 10 nucleotides, at least 15 
nucleotides, at least 100 nucleotides, or at least 125 nucleotides. The 
oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded. The 

25 oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate 
backbone. The oligonucleotide can include other appending groups such as 
peptides, or agents facilitating transport across the cell membrane (see, e.g., 
Letsinger et al„ Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556 (1989); Lemaitre et 
al., Proc. Natl. Acad. Sci. U.S.A. 84:648-652 (1987); PCT Publication No. 

30 WO 88/09810, published December 15, 1988) or blood-brain barrier (see, e.g., 
PCT Publication No. WO 89/10134, published April 25, 1988), hybridization- 
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triggered cleavage agents {see, e.g., Krol et al., BioTechniques 6:958-976 
(1988)) or intercalating agents (see, e.g., Zon, Pharm. Res. 5:539-549 (1988)). 

The MTSP25 polypeptide antisense nucleic acid generally is an oligo- 
nucleotide, typically single-stranded DNA or RNA or an analog thereof or 
5 mixtures thereof. For example, the oligonucleotide includes a sequence 
antisense to a portion of a nucleic acid that encodes a human MTSP25 
polypeptide. The oligonucleotide can be modified at any position on its structure 
with substituents generally known in the art. 

The MTSP25 polypeptide antisense oligonucleotide can include at least 
10 one modified base moiety which is selected from the group including, but not 
limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyI) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 
15 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 
2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl- 
2-thiouracil, beta-D-mannosylqueosine, 5 '-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), 
20 wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 
2-thiouracil, 4-thiouracil, 5-methyIuracil, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl- 2-thiouracil, 3-(3-amino-3-N-2- 
carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide includes at least one modified 
25 sugar moiety selected from the group including but not limited to arabinose, 

2-fluoroarabinose, xylulose, and hexose. The oligonucleotide can include at least 
one modified phosphate backbone selected from a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a 
30 formacetal or analog thereof. 

The oligonucleotide can be an a-anomeric oligonucleotide. An a-anomeric 
oligonucleotide forms specific double-stranded hybrids with complementary RNA 
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in which the strands run parallel to each other (Gautier et al., Nucl. Acids Res. 
15:6625-6641 (1987)). 

The oligonucleotide can be conjugated to another molecule, such as, but 
are not limited to, a peptide, hybridization triggered cross-linking agent, transport 
5 agent or a hybridization-triggered cleavage agent. The oligonucleotides can be 

♦ 

synthesized by standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from Biosearch, Applied 
Biosystems, etc.). As examples, phosphorothioate oligonucleotides can be 
synthesized by the method of Stein et al. (Nucl. Acids Res. 16:3209 (1988)), 
10 methylphosphonate oligonucleotides can be prepared by use of controlled pore 
glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451 
(1988)), etc. 

In a specific embodiment, the MTSP25 polypeptide antisense 
oligonucleotide includes catalytic RNA or a ribozyme (see, e.g., PCT International 

1 5 Publication WO 90/1 1 364, published October 4, 1 990; Sarver et al., Science 
247:1222-1225 (1990)). In another embodiment, the oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et al., Nucl. Acids Res. 15:6131-6148 (1987)), or a 
chimeric RNA-DNA analogue (Inoue et al., FEBS Lett. 215 :327-330 (1987)). 
Alternatively, the oligonucleotide can be double-stranded RNA (dsRNA) such as 

20 RNAi. . 

In an alternative embodiment, the MTSP25 polypeptide antisense nucleic 
acid is produced intracellularly by transcription from an exogenous sequence. 
For example, a vector can be introduced in vivo such that it is taken up by a cell, 
within which cell the vector or a portion thereof is transcribed, producing an 

25 antisense nucleic acid (RNA). Such a vector would contain a sequence encoding 
the MTSP25 polypeptide antisense nucleic acid. Such a vector can remain 
episomal or become chromosomally integrated, as long as it can be transcribed 
to produce the desired antisense RNA. Such vectors can be constructed by 
recombinant DNA technology methods standard in the art. Vectors can be 

30 plasmid, viral, or others known in the art, used for replication and expression in 
mammalian cells. Expression of the sequence encoding the MTSP25 polypeptide 
antisense RNA can be by any promoter known in the art to act in mammalian, 
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including human, cells. Such promoters can be inducible or constitutive. Such 
promoters include but are not limited to: the SV40 early promoter region 
(Bernoist and Chambon, Nature 290 :304-310 (1981), the promoter contained in 
the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., Cell 22:787- 
5 797 (1980), the herpes thymidine kinase promoter (Wagner et al. r Proc. Natl. 
Acad. Sci. U.S.A. 78:1441-1445 (1981), the regulatory sequences of the 
metallothionein gene (Brinster et aL, Nature 296 :39-42 (1982), etc. 

The antisense nucleic acids include sequence complementary to at least a 
portion of an RNA transcript of an MTSP25 polypeptide gene, including a human 
10 MTSP25 polypeptide gene. Absolute complementarily is not required. 

The amount of MTSP25 polypeptide antisense nucleic acid that is 
effective in the treatment or prevention of neoplastic disease depends on the 
nature of the disease, and can be determined empirically by standard clinical 
techniques. Where possible, it is desirable to determine the antisense 
1 5 cytotoxicity in cells in vitro, and then in useful animal model systems prior to 
testing and use in humans. 

2. RNA interference 

RNA interference (RNAi) (see, e.g. Chuang et al. (2000) Proc. Natl. Acad. 
Sci. U.S.A. 57:4985) can be employed to inhibit the expression of a gene 
20 encoding an MTSP25. Interfering RNA (RNAi) fragments, particularly double- 
stranded (ds) RNAi, can be used to generate loss-of-MTSP25 function. Methods 
relating to the use of RNAi to silence genes in organisms including, mammals, C. 
elegans, Drosophila and plants, and humans are known (see, e.g., Fire et al. 

(1998) Nature 357:806-811 Fire (1999) Trends Genet. 75:358-363; Sharp 
25 (2001 ) Genes Dev. 75:485-490; Hammond, et al. (2001 ) Nature Rev. 

Genet. 2:1 10-1 119; Tuschl (2001) Chem. Biochem. 2:239-245; Hamilton et al. 

(1999) Science 255:950-952; Hammond et al. (2000) Nature 404:293-296; 
Zamore et al. (2000) Cell 707:25-33; Bernstein et al. (2001) Nature 409: 363- 
366; Elbashir et al. (2001) Genes Dev. 75:188-200; Elbashir et al. (2001) Nature 

30 47 7:494-498; International PCT application No. WO 01/29058; International 
PCT application No. WO 99/32619). 
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Double-stranded RNA (dsRNA)-expressing constructs are introduced into 
a host, such as an animal or plant using, a replicable vector that remains 
episomal or integrates into the genome. By selecting appropriate sequences, 
expression of dsRNA can interfere with accumulation of endogenous mRNA 
5 encoding an MTSP25. RNAi also can be used to inhibit expression in vitro or in 
vivo. Regions include at least about 21 (or 21) nucleotides that are selective 
(i.e. unique) for MTSP25 are used to prepare the RNAi. Smaller fragments of 
about 21 nucleotides can be transformed directly [i.e., in vitro or in vivo) into 
cells; larger RNAi dsRNA molecules are generally introduced using vectors that 
10 encode them. dsRNA molecules are at least about 21 bp long or longer, such 
as 50, 100, 150, 200 and longer. Methods, reagents and protocols for 
introducing nucleic acid molecules in to cells in vitro and in vivo are known to 
those of skill in the art. 

3. Gene Therapy 
15 ln an exemplary embodiment, nucleic acids that include a sequence of 

nucleotides encoding an MTSP25 polypeptide or functional domains or derivative 
thereof, are administered to promote MTSP25 polypeptide function, by way of 
gene therapy. Gene therapy refers to therapy performed by the administration of 
a nucleic acid to a subject. In this embodiment, the nucleic acid produces its 
20 encoded protein that mediates a therapeutic effect by promoting MTSP25 

polypeptide function. Any of the methods for gene therapy available in the art 
can be used (see, Goldspiel et al., Clinical Pharmacy '12:488-505 (1993); Wu 
and Wu, Biotherapy 3:87-95 (1991); Tolstoshev, An. Rev. Pharmacol. Toxicol. 
32:573-596 (1993); Mulligan, Science 260:926-932 (1993); and Morgan and 
25 Anderson, An. Rev. Biochem. 62:191-217 (1993); TIBTECH 1 1 (5) :1 55-21 5 
(1993). 

For example, one therapeutic composition for gene therapy includes an 
MTSP25 polypeptide-encoding nucleic acid that is part of an expression vector 
that expresses an MTSP25 polypeptide or domain, fragment or chimeric protein 
30 thereof in a suitable host. In particular, such a nucleic acid has a promoter 
operably linked to the MTSP25 polypeptide coding region, the promoter being 
inducible or constitutive, and, optionally, tissue-specific. In another particular 



WO 03/031585 



PCT7US02/32417 



-142- 

embodiment, a nucleic acid molecule is used in which the MTSP25 polypeptide 
coding sequences and any other desired sequences are flanked by regions that 
promote homologous recombination at a desired site in the genome, thus 
providing for intrachromosomal expression of the SP peptide nucleic acid (Koller 
5 and Smithies, Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); Zijlstra et al. f 
Nature 342 :435-438 (1989)). 

Delivery of the nucleic acid into a patient can be either direct, in which 
case the patient is directly exposed to the nucleic acid or nucleic acid-carrying 
vector, or indirect, in which case, cells are first transformed with the nucleic acid 
1 0 in vitro, then transplanted into the patient. These two approaches are known, 
respectively, as in vivo or ex vivo gene therapy. 

In a specific embodiment, the nucleic acid is directly administered in vivo t 
where it is expressed to produce the encoded product. This can be 
accomplished by any of numerous methods known in the art, e.g., by 
1 5 constructing it as part of an appropriate nucleic acid expression vector and 
administering it so that it becomes intracellular, e.g., by infection using a 
defective or attenuated retroviral or other viral vector (see U.S. Patent No. 
4,980,286), or by direct injection of naked DNA, or by use of microparticle 
bombardment [e.g., a gene gun; Bioiistic, Dupont), or coating with lipids or cell- 
20 surface receptors or transfecting agents, encapsulation in liposomes, 

microparticles, or microcapsules, or by administering it in linkage to a peptide 
which is known to enter the nucleus, by administering it in linkage to a ligand 
subject to receptor-mediated endocytosis (see e.g., Wu and Wu, J. Biol. Chem. 
262:4429-4432 (1987)) (which can be used to target cell types specifically 
25 expressing the receptors), etc. In another embodiment, a nucleic acid-ligand 

complex can be formed in which the ligand is a fusogenic viral peptide to disrupt 
endosomes, allowing the nucleic acid to avoid lysosomal degradation. In yet 
another embodiment, the nucleic acid can be targeted in vivo for cell specific 
uptake and expression, by targeting a specific receptor (see, e.g., PCT 
30 Publications WO 92/06180 dated April 16, 1992 (Wu et al.); WO 92/22635 
dated December 23, 1992 (Wilson et al.); WO92/20316 dated November 26, 
1992 (Findeis et ai.); W093/14188 dated July 22, 1993 (Clarke et al.), WO 
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93/20221 dated October 14, 1993 (Young)). Alternatively, the nucleic acid can 
be introduced intracellular^ and incorporated within host cell DNA for 
expression, by homologous recombination (Koller and Smithies, Proc. Natl. Acad. 
Set. USA. 86:8932-8935 (1989); Zijlstra et aL, Nature 342:435-438 (1989)). 
5 In a specific embodiment, a viral vector that contains the MTSP25 

polypeptide nucleic acid is used. For example, a retroviral vector can be used 
(see Miller et al., Meth. Enzymol. 217 :581-599 (1993)). These retroviral vectors 
have been modified to delete retroviral sequences that are not necessary for 
packaging of the viral genome and integration into host cell DNA. The MTSP25 

10 polypeptide nucleic acid to be used in gene therapy is cloned into the vector, 
which facilitates delivery of the gene into a patient. More detail about retroviral 
vectors can be found in Boesen et al., Biotherapy 6:291-302 (1994), which 
describes the use of a retroviral vector to deliver the mdr1 gene to hematopoietic 
stem cells in order to make the stem cells more resistant to chemotherapy. 

15 Other references illustrating the use of retroviral vectors in gene therapy are: 
Clowes et al., J. Clin. Invest. 93:644-651 (1994); Kiem et al., Blood 83:1467- 
1473 (1994); Salmons and Gunzberg, Human Gene Therapy 4:129-141 (1993); 
and Grossman and Wilson, Curr. Opin. in Genetics and DeveL 3:1 10-1 14 
(1993). 

20 Adenoviruses are other viral vectors that can be used in gene therapy. 

Adenoviruses are especially attractive vehicles for delivering genes to respiratory 
epithelia. Adenoviruses naturally infect respiratory epithelia where they cause a 
mild disease. Other targets for adenovirus-based delivery systems are liver, the 
central nervous system, endothelial cells, and muscle. Adenoviruses have the 

25 advantage of being capable of infecting non-dividing cells. Kozarsky and Wilson, 
Current Opinion in Genetics and Development 3:499-503 (1993) present a 
review of adenovirus-based gene therapy. Bout et al., Human Gene Therapy 
5.:3-10 (1 994) demonstrated the use of adenovirus vectors to transfer genes to 
the respiratory epithelia of rhesus monkeys. Other instances of the use of 

30 adenoviruses in gene therapy can be found in Rosenfeld et al., Science 252 :431- 
434 (1991); Rosenfeld et al., Cell 68:143-1 55 (1992); and Mastrangeli et al., J. 
Clin, invest 91:225-234 (1993). 
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Adeno-associated virus (AAV) has also been proposed for use in gene 
therapy (Walsh et aL, Proc. Soc. Exp. Biol. Med. 204 :289-300 (1993). 

Another approach to gene therapy involves transferring a gene to cells in 
tissue culture by such methods as electroporation, lipofection, calcium 
5 phosphate mediated transfection, or viral infection. Usually, the method of 
transfer includes the transfer of a selectable marker to the cells. The cells are 
then placed under selection to isolate those cells that have taken up and are 
expressing the transferred gene. Those cells are then delivered to a patient. 
: In this embodiment, the nucleic acid is introduced into a cell prior to 

10 administration in vivo of the resulting recombinant ceil. Such introduction can 
be carried out by any method known in the art, including but not limited to 
transfection, electroporation, microinjection, infection with a viral or 
bacteriophage vector containing the nucleic acid sequences, cell fusion, 
chromosome-mediated gene transfer, microcell-mediated gene transfer, 

1 5 spheroplast fusion, etc. Numerous techniques are known in the art for the 
introduction of foreign genes into cells (see e.g. , Loeffler and Behr, Meth. 
Enzymol. 217 :599-618 (1993); Cohen et al., Meth. EnzymoL 217 :618-644 
(1993); Cline, Pharmac. Ther. 29:69-92 (1985)) and can be used, provided that 
the necessary developmental and physiological functions of the recipient cells 

20 are not disrupted. The technique should provide for the stable transfer of the 
nucleic acid to the cell, so that the nucleic acid is expressible by the cell and 
generally heritable and expressible by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by various 
methods known in the art. In an embodiment, epithelial cells are injected, e.g., 

25 subcutaneously. In another embodiment, recombinant skin cells can be applied 
as a. skin graft onto the patient. Recombinant blood cells (e.g., hematopoietic 
stem or progenitor cells) can be administered intravenously. The amount of cells 
envisioned for use depends on the desired effect, patient state, etc., and can be 
determined by one skilled in the art. 

30 Cells into which a nucleic acid can be introduced for purposes of gene 

therapy encompass any desired, available cell type, and include but are not 
limited to epithelial cells, endothelial cells, keratinocytes, fibroblasts, muscle 
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cells, hepatocytes; blood cells such as T lymphocytes, B lymphocytes, 
monocytes, macrophages, neutrophils, eosinophils, megakaryocytes, 
granulocytes; various stem or progenitor cells, in particular hematopoietic stem 
or progenitor cells, e.g., such as stem cells obtained from bone marrow, 
5 umbilical cord blood, peripheral blood, fetal liver, and other sources thereof. 

For example, a cell used for gene therapy is autologous to the patient. In 
an embodiment in which recombinant cells are used in gene therapy, an MTSP25 
polypeptide nucleic acid is introduced into the cells such that it is expressible by 
the cells or their progeny, and the recombinant cells are then administered in 

10 vivo for therapeutic effect. In a specific embodiment, stem or progenitor cells 
are used. Any stem and/or progenitor cells which can be isolated and 
maintained in vitro can potentially be used in accordance with this embodiment. 
Such stem cells include but are not limited to hematopoietic stem cells (HSC), 
stem cells of epithelial tissues such as the skin and the lining of the gut, 

15 embryonic heart muscle cells, liver stem cells (PCT Publication WO 94/08598, 
dated April 28, 1994), and neural stem cells (Stemple and Anderson, Cell 
71:973-985 (1992)). 

Epithelial stem cells (ESCs) or keratinocytes can be obtained from tissues 
such as the skin and the lining of the gut by known procedures (Rheinwald, 

20 Meth. Cell Bio. 27/4:229 (1980)). In stratified epithelial tissue such as the skin, 
renewal occurs by mitosis of stem cells within the germinal layer, the layer 
closest to the basal lamina. Stem cells within the lining of the gut provide for a 
rapid renewal rate of this tissue. ESCs or keratinocytes obtained from the skin 
or lining of the gut of a patient or donor can be grown in tissue culture 

25 (Rheinwald, Meth. Cell Bio. 21A:229 (1980); Pittelkow and Scott, Cano Clinic 
Proc. Sliiy) (1986)). If the ESCs are provided by a donor, a method for 
suppression of host versus graft reactivity {e.g., irradiation, drug or antibody 
administration to promote moderate immunosuppression) also can be used. 
With respect to hematopoietic stem cells (HSC), any technique which 

30 provides for the isolation, propagation, and maintenance in vitro of HSC can be 
used in this embodiment. Techniques by which this can be accomplished 
include (a) the isolation and establishment of HSC cultures from bone marrow 
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cells isolated from the future host, or a donor, or (b) the use of previously 
established long-term HSC cultures, which can be allogeneic or xenogeneic. 
Non-autologous HSC generally are used with a method of suppressing 
transplantation immune reactions of the future host/patient. In a particular 
5 embodiment, human bone marrow cells can be obtained from the posterior iliac 
crest by needle aspiration (see, e.g., Kodo et al., J. Clin. Invest. 75:1377-1384 
(1984)). For example, the HSCs can be made highly enriched or in substantially 
pure form. This enrichment can be accomplished before, during, or after long- 
term culturing, and can be done by any techniques known in the art. Long-term 
10 cultures of bone marrow cells can be established and maintained by using, for 
example, modified Dexter cell culture techniques (Dexter et al., J. Cell Physiol. 
S/:335 (1977) or Witlock-Witte culture techniques (Witlock and Witte, Proc. 
Natl. Acad. Sci. USA 73:3608-3612 (1982)). 

In a specific embodiment, the nucleic acid to be introduced for purposes 
15 of gene therapy includes an inducible promoter operably linked to the coding 

region, such that expression of the nucleic acid is controllable by controlling the 
presence or absence of the appropriate inducer of transcription. 

3. Prodrugs 

A method for treating tumors is provided. The method is practiced by 
20 administering a prodrug that is cleaved at a specific site by an MTSP25 to 

release an active drug or precursor that can be converted to active drug in vivo. 
Upon contact with a cell that expresses MTSP25 activity, the prodrug is 
converted into an active drug. The prodrug can be a conjugate that contains the 
active agent, such as an anti-tumor drug, such as a cytotoxic agent, or other 
25 therapeutic agent (TA), linked to a substrate for the targeted MTSP25, such that 
the drug or agent is inactive or unable to enter a cell, in the conjugate, but is 
activated upon cleavage. The prodrug, for example, can contain an oligopeptide, 
typically a relatively short, less than about 10 amino acids peptide, that is 
proteolytically cleaved by the targeted MTSP25. Cytotoxic agents, include, but 
30 are not limited to, alkylating agents, antiproliferative agents and tubujin binding 
agents. Others include, vinca drugs, mitomycins, bleomycins and taxanes. 
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M. Animal models 

Transgenic animal models and animals, such as rodents, including mice 
and rats, cows, chickens, pigs, goats, sheep, monkeys, including gorillas, and 
other primates, are provided herein. In particular, transgenic non-human animals 
5 that contain heterologous nucleic acid encoding an MTSP25 polypeptide or a 
transgenic animal in which expression of the polypeptide has been altered, such 
as by replacing or modifying the promoter region or other regulatory region of 
the endogenous gene are provided. Such an animal can by produced by 
promoting recombination between endogenous nucleic acid and an exogenous 
10 MTSP25 gene that could be over-expressed or mis-expressed, such as by 
expression under a strong promoter, via homologous or other recombination 
event. 

Transgenic animals can be produced by introducing the nucleic acid using 
any known method of delivery, including, but not limited to, microinjection, 
1 5 lipofection and other modes of gene delivery into a germline cell or somatic cells, 
such as an embryonic stem cell. Typically the nucleic acid is introduced into a 
cell, such as an embryonic stem cell (ES), followed by injecting the ES cells into 
a blastocyst, and implanting the blastocyst into a foster mother, which is 
followed by the birth of a transgenic animal. Generally, introduction of a 

i 

20 heterologous nucleic acid molecule into a chromosome of the animal occurs by a 
recombination between the heterologous MTSP25-encoding nucleic acid and 
endogenous nucleic acid. The heterologous nucleic acid can be targeted to a 
specific chromosome. 

In some instances, knockout animals can be produced. Such an animal 

25 can be initially produced by promoting homologous recombination between an 
MTSP25 polypeptide gene in its chromosome and an exogenous MTSP25 
1 polypeptide gene that has been rendered biologically inactive (typically by 

insertion of a heterologous sequence, e.g., an antibiotic resistance gene). In one 
embodiment, this homologous recombination is performed by transforming 

30 embryo-derived stem (ES) cells with a vector containing the insertionally 

inactivated MTSP25 polypeptide gene, such that homologous recombination 
occurs, followed by injecting the ES ceils into a blastocyst, and implanting the 
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blastocyst into a foster mother, followed by the birth of the chimeric animal 
("knockout animal") in which an MTSP25 polypeptide gene has been inactivated 
(see Capecchi, Science 244 :1288-1 292 (1989)). The chimeric animal can be 
bred to produce homozygous knockout animals, which can then be used to 
5 produce additional knockout animals. Knockout animals include, but are not 
limited to, mice, hamsters, sheep, pigs, cattle, and other non-human mammals. 
For example, a knockout mouse is produced. The resulting animals can serve 
as models of specific diseases, such as cancers, that exhibit under-expression of 
an MTSP25 polypeptide. Such knockout animals can be used as animal models 

10 of such diseases e.g., to screen for or test molecules for the ability to treat or 
prevent such diseases or disorders. 

Other types of transgenic animals also can be produced, including those 
that over-express the MTSP25 polypeptide. Such animals include "knock-in" 
animals that are animals in which the normal gene is replaced by a variant, such 

15 as a mutant, an over-expressed form, or other form. For example, one species', 
such as a rodent's endogenous gene can be replaced by the gene from another 
species, such as from a human. Animals also can be produced by non- 
homologous recombination into other sites in a chromosome; including animals 
that have a plurality of integration events. 

20 After production of the first generation transgenic animal, a chimeric 

animal can be bred to produce additional animals with over-expressed or mis- 
expressed MTSP25 polypeptides. Such animals include, but are not limited to, 
mice, hamsters, sheep, pigs, cattle and other non-human mammals. The 
resulting animals can serve as models of specific diseases, such as cancers, that 

25 exhibit over-expression or mis-expression of an MTSP25 polypeptide. Such 
animals can be used as animal models of such diseases e.g., to screen for or 
test molecules for the ability to treat or prevent such diseases. or disorders. In a 
specific embodiment, a mouse with over-expressed or mis-expressed MTSP25 
polypeptide is produced. 

30 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 
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EXAMPLE 1 

Identification of MTSP25 

The protein sequence of the protease domain of enterokinase (accession 
number U09860) was used to search the human HTGS (High Throughput 
5 Genomic Sequence) database using the tblastn algorithm. This search and 
alignment algorithm compares a protein query sequence against a nucleotide 
sequence database dynamically translated in all six reading frames (both 
strands). A serine protease, referred to herein MTSP25 was identified. Based 
on the incomplete and unordered human genome sequences, MTSP25 appears 

10 to be localized on chromosome 12 (AC008121, AC009727 & AC013244). A 
search of sequences deposited in GenBank database showed that no identical 
sequence has been deposited. A search of sequences deposited in an EST 
database showed that two cDNA sequences (BG722131 and BG722203) 
derived from a human testis cDNA library match a segment of the MTSP25 

15 sequence (149 and 100 nucleotides, respectively). International PCT application 
No. WO 01/49864 describes a protein, which appears to be a zymogen, that 
includes a sequence of amino acids substantially identical to a full-length 
MTSP25 provided herein. This PCT applcation, however, does not provide 
numerous embodiments provided herein, such as single-chain and two-chain 

20 forms of a protease domain, or two-chain activated forms of the full-length 
polypeptide nor catalytically active fragments thereof, or uses of any of the 
polypeptides as provided herein. 

Cloning of cDNA Encoding the Protease Domain of MTSP25 

Using the nucleotide sequence of MTSP25 derived from the genomic 
25 sequence, two gene specific oligonucleotide primers were designed. The 
sequence for the 5'-end primer (Ch1 2-NSP1 -1 ) is 5'- 

GCACCGCTTAAGGATGTGTTGCAAGGGTC-3' SEQ ID No. 7 and that of the 3'- 
end primer (Ch1 2-NSP1-8AS) is 5'-ATGCTCTGTCAGCCACTTTTGGTAGAAGG- 
3' SEQ ID No. 8. A band of 747 bp was amplified from human testis Marathon- 
30 Ready cDNA (Clontech). Subsequent sequence analysis showed that the 
nucleotide sequence of this DNA fragment matched that of the genomic 
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MTSP25 exon sequences and encoded a portion of the MTSP25 protease 
domain, but did not include a stop codon. 

To obtain cDNA encoding the 3'-end of MTSP25 mRNA, 3'-RACE (rapid 
amplification of cDNA ends) reactions were performed on a Marathon-Ready 
5 cDNA library from human testis (Ciontech). Marathon-Ready cDNAs are 

specifically made for RACE reactions. The first RACE reactions were performed 
by PCR using GeneRacer cDNA 3'-RACE primer with gene specific primer, Ch12- 
NSP1-7, 5'-GGAAACACAAAGTGTTTTATAAGTGGCTGG-3' SEQ ID No. 9.. 
The resulting PCR products were purified after electrophoresis through an 
10 agarose gel. 

A second nested PCR was then performed using GeneRacer 3'RACE 
nested primer with gene specific primer (Ch1 2-NSP1-9) 5'- 

CATTTTGTGCAGGTGATGAAGATGGAGC-3' (SEQ ID No. 10; using the first 3'- 
RACE product as template). PCR products greater than 500 bp were purified 

15 after electrophoresis through an agrose gel. Purified PCR products were 

subcloned into pCR2,1-TOPO cloning vector (Invitrogen, San Diego). Colony 
hybridization was then performed to identify positive colonies containing 
MTSP25 cDNA. Positive clones were identified by colony hybridization using a 
747 bp DNA fragment containing MTSP25 protease domain sequence (obtained 

20 from PCR reaction with primers CM2-NSP1-1 (SEQ ID No. 7) and Ch12-NSP1- 
8AS (SEQ ID No. 8) and by DNA sequencing. Sequence analysis of the 3'-RACE 
products indicated that an additional 99 bp sequence including the stop codon 
was obtained. 

To obtain the remaining 5' upstream cDNA of MTSP25, 5'-RACE 
25 reactions were performed on the human testis RACE cDNA synthesized using 
FirstChoice® RLM-RACE kit (Ambion, Cat. No. L1500-O1, see, e.g., 
http://www.ambion.com). GeneRacer kit is specifically designed for full-length, 
RNA ligase-mediated rapid amplification of 5' and 3' cDNA ends (RLM-RACE). 
The first 5'-RACE reaction was performed by PCR using GeneRacer 5'-RACE 
30 primer with gene specific primers, ch1 2-NSP1-8AS, 5'- 

ATGCTCTGTCAGCCACTTTTGGTAGAAGG-3' (SEQ ID No. 8). The PCR 
products were purified from an agarose gel. A second nested PCR was then 
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performed using GeneRacer 5' RACE nested primer with gene specific primer 
CM2-NSP1-4AS, 5'-CACAGCTGTCCACATTAAAGGATCGCTAGC-3' (SEQ ID 
No. 17); using the first 5'-RACE product as template). The PCR products from 
RACE reactions, which were greater than 500 bp, were purified from agarose gel 
5 and subcloned into pCR2.1-TOPO cloning vector (Invitrogen). Colony 
hybridization was then performed to identify positive colonies containing 
MTSP25 sequence. An additional sequence of 273 bp was obtained from the 
second 5'-RACE products including an ATG start codon within the 
TCCAAAATGCGG SEQ ID No. 18 sequence. 

10. Gene expression profile of MTSP25 in normal and tumor tissues 

To obtain information regarding the gene expression profile of the 
MTSP25 transcript, a 369 bp DNA fragment containing MTSP25 protease 
domain sequence (obtained from PCR reaction with primers CM2-NSP1-1 (SEQ 
ID No. 7) and Ch1 2-NSP1-6AS, 5'- 

15 AGGCTGAATATAGTCATTATACCTCACTGC-3' (SEQ ID No. 11) was used to 
probe a dot blot composed of RNA extracted from 72 different human tissues 
(Human Multiple Tissue Expression (MTE) Array; Clontech, Palo Alto, CA; 
catalog no. 7776-1) as well as a dot blot composed of normalized cDNA from 
241 tumor and corresponding normal tissues from individual patients (Cancer 

20 Profiling Array, Clontech, catalog no. 7841-1). The results of MTE analysis 

indicate that MTSP25 transcript is expressed weakly in the lymph node. In the 
cancer profiling array analysis, MTSP25 is highly expressed in all 4 prostate 
samples (in normal and cancer samples). In one of the 20 kidney cDNA pairs, 
MTSP25 is highly expressed in the tumor sample, but not in its normal tissue 

25 counterpart. MTSP25 is also expressed in -1 of the 50 breast cancer samples, 
but not in its normal tissue counterpart. MTSP25 is also expressed in 3 of the 
42 normal uterus samples, but not in their tumor counterparts. MTSP25 
expression is also detected in 3 of the 14 ovarian cancer samples. Among these 
three samples, the expression of MTSP25 was also detected in one of the 

30 matched normal tissue counterparts. MTSP25 expression was also detected in 5 
tumor samples in the 34 colon cDNA pairs. 
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PCR analysis was also performed using a commercially available cDNA 
panel from several human adult tissues (Clontech, Cat. #K1 420-1 and K1 420-2) 
as well as several Marathon-Ready cDNAs (Clontech). The MTSP25 specific 
primers CM 2-NSP1-1 is 5'-GCACCGCTTAAGGATGTGTTGCAAGGGTC-3' SEQ 
5 ID No. 7 and Ch1 2-NSP1 -6AS (S'-AGGCTGAATATAGTCATTATACCTCACTGC- 
3' SEQ ID No. 1 1 were used in these reactions. MTSP25 cDNA was strongly 
detected in testis and mammary gland adenocarcinoma, weakly detected in 
brain, placenta, lung, spleen, prostate, small intestine, colon, and leukocyte, and 
very weakly detected in heart, liver, and pancreas. 

1 0 Domain structure of MTSP25 and homology to other proteases 

Sequence analysis of the translated nucleotide sequence of MTSP25 and 
protein domain prediction analysis revealed that MTSP25 contains a trypsin-like 
serine protease domain (residues 78-323 (or truncated variants (78-313, 314, 
315, 316 . . . 322, or shorter variants that are catalytically active, following 

15 cleavage)) characterized by the presence of a protease activation cleavage site 
(...R 77 i l 70 IGG..., where 4 indicates protease cleavage site) at the amino 
terminus of the domain and the catalytic triad residues (H 122 , D 17 , and S 268 ) in 
three highly-conserved regions. In addition, two other domains, a signal peptide 
sequence (residues 1-16 SEQ ID No. 16) and a C-terminal transmembrane 

20 domain (residues 324-346 SEQ ID No. 16), occur at the amino and carboxy 
termini, respectively, of MTSP25. There are 2 potential A/-glycosylation sites: 
N 219 AT and N 249 TS. The unpaired C 191 in the serine protease domain is predicted 
to pair with C 64 outside of the serine protease domain. Within the serine 
protease domain, the following cysteine pairings occur: C 107 -C l23 , C 206 -C 274r C 237 - 

25 C 253 , and C 2e4 -C 294 . One cysteine {C^o) within the C-terminal transmembrane 
domain is apparently unpaired. Alignment (see, e.g., b/astp; 
http://www.ncbi.nIm.nih.gov/BLAST) of MTSP25 protein sequence with that of 
enterokinase (GenBank accession number U09860) indicates that the two 
proteins share 41 % sequence identity in their protease domains. 

30 Sequence analysis 

MTSP25 cDNA and protein sequences were analyzed using MacVector 
(version 6.5.3; see, e.g., http://www.acceirys.com). The cDNA encoding 
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MTSP25 is 1364 bp long with an open reading frame that is 1047 bp in length. 
This ORF translates into a 348-amino acid protein. The cDNA sequence and the 
translated protein sequence of MTSP25 are as follows (see, also SEQ ID Nos. 15 
and 16, respectively). 
5 Sequence Range; 1 to 1364 

* 

10 20 30 40 50 60 

AAAGGGAAGTACCTGCCGCCATCTTGCTCACCAGCCTCCAAAATGCGGCTGGGGCTCCTG 
TTTC C CTT CATGGACGGCGGT AGAACGAGTGGTCG GAGGT TTTACG C CGACC C CGAGGAC 
10 M R L G L L 

70 80 90 100 110 120 

AGCGTGGCGCTGTTGTTTGTGGGGAGCTCTCACTTATACTCAGACCACTACTCGCCCTCT 
_ _ TCGCACCGCGACAACAAACACCCCTCGAGAGTGAATATGAGTCTGGTGATGAGCGGGAGA 
15 SVALLFVGSSHliYSDHYSPS 

130 140 150 160 170 180 

GGAAGGCACAGGCTCGGCCCCTCGCCGGAACCGGCGGCTAGTTCCCAGCAGGCTGAGGCC 
CCTTCCGTGTCCGAGCCGGGGAGCGGCCTTGGCCGCCGATCAAGGGTCGTCCGACTCCGG 
20 GRHRLGPS PEPAASSQQAEA 

190 200 210 220 230 240 

GTCCGCAAGAGGCTCCGGCGGCGGAGGGAGGGAGGGGCGCATGCAAAGGATTGTGGAACA 
CAGGCGTTCJTCCGAGGCCGCCGCCTCCCTCCCTCCCCGCGTACGTTTCCTAACACCTTGT 
25 VRKRLRRRREGGAHAKDCGT 

250 260 270 280 290 300 

GCACCGCTTAAGGATGTGTTGCAAGGGTCTCGGATTATAGGGGGCACCGAAGCACAAGCT 
CGTGGCGAATTCCTACACAACGTTCCCAGAGCCTAATATCCCCCGTGGCTTCGTGTTCGA 
30 APLiKDVLQG'SRI IGGTB AQA 

310 320 330 340 350 360 

GGCGCATGGCCGTGGGTGGTGAGCCTGCAGATTAAATATGGCCGTGTTCTTGTTCATGTA 
CCGCGTACCGGCACCCACCACTCGGACGTCTAATTTATACCGGCACAAGAACAAGTACAT 
35 GAWPWVVSLQIKYGRVLVHV 

370 380 390 400 410 420 

TGTGGGGGAACCCTAGTGAGAGAGAGGTGGGTCCTCACAGCTGCCCACTGCACTAAAGAC 
ACACCCCCTTGGGATCACTCTCTCTCCACCCAGGAGTGTCGACGGGTGACGTGATTTCTG 
40 CGGTLVRERWVLTAAHCTKD 

• 

430 440 450 460 470 480 

G CT AGCGAT CCTTTAATGTGGACAGCT GTGATTGGAACTAAT AATAT ACATG GACGCTAT 
CGATCGCTAGGAAATTACACCTGTCGACACTAACCTTGATTATTATATGTACCTGCGATA 
45 ASDPLMWTAVIGTNNIHGRY 

490 500 510 520 530 540 

CCTCATACCAAGAAGATAAAAATTAAAGCAATCATTATTCATCCAAACTTCATTTTGGAA 
GGAGT ATGGTTC T TCTATTT TTAATTTCGTTAGT AAT AAGTAGGTTT GAAGTAAAACOTT 
50 PHTKKIKIKAIIIHPNFILE 

550 560 570 580 590 600 

TCTTATGTAAATGATATTGCACTTTTTCACTTAAAAAAAGCAGTGAGGTATAATGACTAT 
AGAATACATTTACTATAACGTGAAAAAGTGAATTTTTTT CGT CACT C CAT ATT ACTGAT A 
55 SYVNDIALFHLKKAVRYNDY 

610 620 630 640 650 660 

ATTCAGCCTATTTGCCTACCTTTTGATGTTTTCCAAATCCTGGACGGAAACACAAAGTGT 
T7VA.GT CGGATAAACGGATGGAAAACT ACAAAAGGTTTAGGAC CTG C CTTTGTGTT T CACA 
60 IQPICXjPFDVFQILDGNTKC 
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10 



15 



20 



25 



30 



35 



40 



670 680 690 700 710 720 

TTTATAAGTGGCTGGGGAAGAACAAAAGAAGAAGGTAACGCTACAAATATTTTACAAGAT 
AAATATTCACCGACCCCTTCTTGTTTTCTTCTTCCATTGCGATGTTTATAAAATGTTCTA 
F I SGWGRTKEEGNATNILQD 

730 740 750 760 770 780 

GCAGAAGTGC ATT ATATT T CTCGAGAGATGT GTAATTC TGAGAGGAGTTATGGGGGAATA 
CGT CTT CACGTAAT ATAAAGAG CT CT CT AC ACATTAAGACTCT C CT C AATAC CCC CTTAT 
AEVHY ISREMCNSERSYGGI 

790 800 810 820 830 840 

ATTCCTAACACTTCATTTTGTGCAGGTGATGAAGATGGAGCTTTTGATACTTGCAGGGGT 
TAAGGATTGTGAAGTAAAACACGTCCACTACTTCTACCTCGAAAACTATGAACGTCCCCA 
I PNTS FCAGDEDG AFDT CRG 

850 860 870 880 890 900 

GACAGTGGGGGACCATTAATGTGCTACTTACCAGAATATAAAAGATTTTTTGTAATGGGA 
CT GTCACCC CCTGGT AATTACACGATGAATGGTC TTAT ATTTTCTAAAAAAC ATTAC C CT 
DSGGPL .MCY LPEYKRFFVMG 

910 920 930 940 950 960 

ATTACCAGTTACGGACATGGCTGTGGTCGAAGAGGTTTTCCTGGTGTCTATATTGGGCCA 
TAATGGTCAATGCCTGTACCGACACCAGCTTCTCCAAAAGGACCACAGATATAACCCGGT 
I TSYGHGCGRRGFP GVYIGP 

970 980 990 1000 1010 1020 

T C CT T CTAC CAAAAGTGGCTGACAGAGC ATTT C TTC C ATGC AAGCACT CAAGG C ATACTT 
AGGAAGATGGTTTTCACCGACTGTCTCGTAAAGAAGGTACGTTCGTGAGTTCCGTATGAA 
SFYQKWLTEHFFHASTQGIL 

1030 1040 1050 1060 1070 1080 

ACTATAAATATTTTACGTGGC CAGATC C TCAT AGCTTTATGTTT TGT CATCT TACTAGCA 
TGATATTTATAAAATGCAC CGGT CTAGGAGTAT CGAAAT ACAAAACAGT AGAATGAT CGT 
TINILRGQILIALCFVILLA 

1090 1100 1110 1120 1130 1140 

ACAACAT AAAGAAAT TCTGAAGGCTTT CATAT CT TTATT TTGCATTGTGT C C CTTTCT AT 

TGTT GTATTT CTTTAAGACTTC CGAAAGTAT AGAAAT AAAACGT AAC ACAGGGAAAGATA 
T T * 

1150 1160 1170 1180 1190 1200 

GTT CTATATAATGAAC AT C ATTTATT C TTCT AGCAATTAATTGC CTACATTAGAGATTT C 
CAAGATAT ATTACTT GTAGTAAATAAGAAGAT CGTT AATTAACGGATGTAAT CT CT AAAG 



45 1210 1220 1230 1240 1250 1260 

ATGTGAACATTTTATGGGCTATAAGTATTGTGACAGATATACAATTGTAATTTTGGCACT 
TAC ACTTGTAAAATACC C GATATTC AT AAC ACTGTCTAT ATGTT AAC ATTAAAAC CGTGA 

1270 1280 1290 1300 1310 1320 

5 O GAATCACATGTCTCCTTGAAATATCTTGATTATTTTATAATCATAATTCTGTATCTGGAA 

C TT AGTGTAC AGAGGAACTTTATAGAACTAATAAAATATT AGT ATTAAGAC ATAGAC CTT 

1330 1340 1350 1360 

TACTCATAGAGTTTGTACAAAATATTCAGTTAAACATATATTTA 
5 5 ATGAGTATCTCAAACATGTTTTATAAGTCAATTTGTATATAAAT 

EXAMPLE 2 
Expression of the protease MTSP domains 

Nucleic acid encoding each a full length MTSP25 and/or protease domain 
thereof can be cloned into a derivative of the Pichia pastoris vector pPIC9K 
5 (available from invitrogen; see SEQ ID NO. 13), called pPCI9K. Plasmid pP!C9K 
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features include the 5' A0X1 promoter fragment at 1-948; 5' AOX1 primer site 
at 855-875; alpha-factor secretion signal(s) at 949-1218; alpha-factor primer 
site at 1 1 52-1 1 72; multiple cloning site at 1 1 92-1 241 ; 3' AOX1 primer site at 
1327-1347; 3' AOX1 transcription termination region at 1253-1586; HIS4 ORF 
5 at 4514-1980; kanamycin resistance gene at 5743-4928; 3' AOX1 fragment at 
6122-6879; ColE1 origin at 7961-7288; and the ampicillin resistance gene at 
8966-8106. The plasmid is derived from pPIC9K by eliminating the Xhol site in 
the kanamycin resistance gene and the resulting vector is designated pPIC9KX. 
Other vectors that can be used include insect and mammalian vectors as 
10 described, for example, above. The protein also can be expressed in E. coif in, 
for example, incusion bodies in the cytoplasm or in the cytoplasm using the 
strain Origami (i.e., Origami B from Novagen, Madison Wl) permit folding in the 
cytoplasm, and also can be expresed in the periplasmic space. 

EXAMPLE 3 

1 5 Assays for identification of candidate compounds that modulate that activity of 
an MTSP 

Assay for screening MTSP25 Inhibitors 

The protease domain of MTSP25 expressed in Pichia pastoris is assayed 
for inhibition by various compounds as follows in Costar 96 well tissue culture 

20 plates (Corning NY). Approximately 1-10 nM MTSP25 is added without 

inhibitor, or with 100000 nM inhibitor and 7 1 :6 dilutions to 1X direct buffer 
(29.2 mM Tris, pH 8.4, 29.2 mM Imidazole, 217 mM NaCI (100 fjL final 
volume)), and allowed to incubate at room temperature for 30 minutes. 400 jjM 
substrate Spectrozyme t-PA (American Diagnostica,, Greenwich, CT) is added 

25 and reaction is monitored in a SpectraMAX Plus microplate reader (Molecular 
Devices, Sunnyvale CA) by following change in absorbance at 405 nm for 20 
minutes at 37 °C. Spectrozyme UK can also be used as the substrate in this 
assay. 

Identification of substrates 

30 Other substrates for use in the assays can be identified empirically by 

testing substrates. The following list of substrates are exemplary of those that 
can be tested. 
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| Substrate name 


Structure 


S 2366 — 


pyrooiu-rro-Mrg-piM/\. Mtl 


(spectrozyme t-PA 


CH 3 S0 2 -D-HHT-Gly-Arg-pNA.AcOH 


i\i-p-tosyl-Gly-Pro-Arg-pNA 


N-p-tosyl-Gly-Pro-Arg-pNA j 


Denzoyl-val-Gly-Arg-pNA 


tJenzoyl-Val-Gly-Arg-pNA j 


refachrome t-PA 

1 C O *7 C 


CH 3 S0 2 -D-HHT-Gly-Arg-pNA 


|o Z/bb 

loo/iyiyi 


N-a-Z-D-Arg-Gly-Arg-pNA.2HCI | 




pyroGlu-Gly-Arg-pNA.HCI | 




H-D-l!e-Pro-Arg-pNA.2HCI j 


spectrozyme UK 


Cbo-L-(K)Glu{a-t-BuO)-Gly-Arg-pNA.2AcOH 




H-D-Pro-Phe-Arg-pNA . 2H CI j 


O ZZDD 


H-D-Val-Leu-Arg-pNA.2HCI | 


1 c o o o o 


Bz-lle-Glu(g-OR)-Gly-Arg-pNA.HCI 
n — nloU/b) and n = CH 3 (50%) 


win ui i iuz.y j 1 1 1? i IV 


Denzoyi-rro-r'ne-Arg-pNA 


|S 2238 


H-D-Phe-Pip-Arg-pNA.2HCI | 


S 2251 


H-D-Val-Leu-Lys-pNA.2HCI | 


Spectrozyme PI 


H-D-Nle-HHT-Lys-pNA.2AcOH | 




Pyr-Arg-Thr-Lys-Arg-AMC | 




H-Arg-Gln-Arg-Arg-AMC | 




BoG-Gln-Gly-Arg-AMC | 




Z-Arg-Arg-AMC | 


Spectrozyme THE 


H-D-HHT-Ala-Arg-pNA.2AcOH j 


Spectrozyme fXlla 


H-D-CHT-Gly-Arg-pNA.2AcOH | 




CVS 2081-6 (MeS0 2 -dPhe-Pro-Arg-pNA) | 


A _ ~ /_u-. 


Pefachrome fVUa (CH 3 S0 2 -D-CHA-But-Arg-pNA) | 



30 



AMC = amino methyl coumarin (fluorescent) 

If none of the above substrates are cleaved, a coupled assay, can be 
used. Briefly, a coupled assay tests the ability of the protease to activate an 
enzyme, such as plasminogen and trypsinogen. To perform these assays, the 
single chain protease is incubated with a zymogen, such as plasminogen or 
trypsinogen, in the presence of a known substrate, such as Spectrozyme PI, for 
the zymogen. If the single chain activates the zymogen, the activated enzyme, 
such as plasmin and trypsin, will degrade the substrate therefor. 
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EXAMPLE 4 

Other Assays 

These assays are described with reference to MTSP1, but such assays 
can be readily adapted for use with MTSP25. 

5 Amidolytic Assay for Determining Inhibition of Serine Protease 

Activity of Matriptase or MTSP1 

The ability of test compounds to act as inhibitors of rMAP catalytic 
activity is assessed by determining the inhibitor-induced inhibition of amidolytic 
activity by the MAP, as measured by IC 50 values. The assay buffer is HBSA (10 
10 mM Hepes, 1 50mM sodium chloride, pH 7.4, 0.1 % bovine serum albumin). All 
reagents were from Sigma Chemical Co. {St. Louis, MO), unless otherwise 
indicated. 

Two IC so assays (a) one at either 30-minutes or 60-minutes (a 30-minute 
or a 60-minute preincubation of test compound and enzyme) and (b) one at 

15 0-minutes (no preincubation of test compound and enzyme) were conducted. 
For the IC S0 assay at either 30-minutes or 60-minutes, the following reagents 
were combined in appropriate wells of a Corning nriicrotiter plate: 50 microliters 
of HBSA, 50 microliters of the test compound, diluted (covering a broad 
concentration range) in HBSA (or HBSA alone for uninhibited velocity 

20 measurement), and 50 microliters of the rMAP (Corvas International) diluted in 
buffer, yielding a final enzyme concentration of 250 pM as determined by active 
site filtration. Following either a 30-minute or a 60-minute incubation at ambient 
temperature, the assay is initiated by the addition of 50 microliters of the 
substrate S-2765 (N-a-Benzyloxycarbonyl-D-arginyl-L-glycyl-L-arginine-p- 

25 nitroaniline dihydrochloride; DiaPharma Group, Inc.; Franklin, OH) to each well, 
yielding a final assay volume of 200 microliters and a final substrate 
concentration of 100/yM (about 4-times KJ. Before addition to the assay 
mixture, S-2765 is reconstituted in deionized water and diluted in HBSA. For the 
IC 50 assay at 0 minutes; the same reagents were combined: 50 microliters of 

30 HBSA, 50 microliters of the test compound, diluted (covering the identical 
concentration range) in HBSA (or HBSA alone for uninhibited velocity 
measurement), and 50 microliters of the substrate S-2765. The assay is 
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initiated by the addition of 50 microliters of rMAP. The final concentrations of 
all components were identical in both IC S0 assays (at 30- or 60- and 0-minute). 

The initial velocity of chromogenic substrate hydrolysis is measured in 
both assays by the change of absorbance at 405 nm using a Thermo Max® 
5 Kinetic Microplate Reader (Molecular Devices) over a 5 minute period, in which 
less than 5% of the added substrate is used. The concentration of added 
inhibitor, which caused a 50% decrease in the initial rate of hydrolysis is defined 
as the respective IC 50 value in each of the two assays (30- or 60-minutes and 
0-minute). 

10 In vitro enzyme assays for specificity determination 

The ability of compounds to act as a selective inhibitor of matriptase 
activity was assessed by determining the concentration of test compound that 
inhibits the activity of matriptase by 50%, (IC S0 ) as described in the above 
Example, and comparing IC 50 value for matriptase to that determined for all or 
15 some of the following serine proteases: thrombin, recombinant tissue 

plasminogen activator (rt-PA), plasmin, activated protein C, chymotrypsin and 
factor Xa. The buffer used for all assays was HBSA (10 mM HEPES, pH 7.5, 
150 mM sodium chloride, 0.1% bovine serum albumin)! 

The assay for IC 50 determinations was conducted by combining in 
20 appropriate wells of a Corning microtiter plate, 50 microliters of HBSA, 50 

microliters of the test compound at a specified concentration (covering a broad 
concentration range) diluted in HBSA (or HBSA alone for V 0 (uninhibited velocity) 
measurement), and 50 microliters of the enzyme diluted in HBSA. Following a 
30 minute incubation at ambient temperature, 50 microliters of the substrate at 
25 the concentrations specified below' were added to the wells, yielding a final total 
volume of 200 microliters. The initial velocity of chromogenic substrate 
hydrolysis was measured by the change in absorbance at 405 nm using a 
Thermo Max® Kinetic Microplate Reader over a 5 minute period in which less 
than 5% of the added substrate was used. The concentration of added inhibitor 
30 which caused a 50% decrease in the initial rate of hydrolysis was defined as the 
IC 50 value. 
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Thrombin (flla) Assay 

Enzyme activity was determined using the chromogenic substrate, 
Pefachrome t-PA (CH 3 S0 2 -D-hexahydrotyrosine-glycyl-L-Arginine-p-nitroaniIine / 
obtained from Pentapharm Ltd.). The substrate was reconstituted in deionized 
5 water prior to use. Purified human a-thrombin was obtained from Enzyme 

Research Laboratories, Inc. The buffer used for all assays was HBSA (10 mM 
HEPES, pH 7.5, 150 mM sodium chloride, 0.1% bovine serum albumin). 

IC 50 determinations were conducted where HBSA (50 pL), a-thrombin (50 
//I) (the final enzyme concentration is 0.5 nM) and inhibitor (50 ju\) (covering a 
10 broad concentration range), were combined in appropriate wells and incubated 
for 30 minutes at room temperature prior to the addition of substrate 
Pefachrome-t-PA (50 (the final substrate concentration is 250 >uM, about 5 
times Km). The initial velocity of Pefachrome t-PA hydrolysis was measured by 
the change in absorbance at 405 nm using a Thermo Max® Kinetic Microplate 
15 Reader over a 5 minute period in which less than 5% of the added substrate was 
used. The concentration of added inhibitor which caused a 50% decrease in the 
initial rate of hydrolysis was defined as the IC 50 value. 

Factor Xa 

Factor Xa catalytic activity was determined using the chromogenic 
20 substrate S-2765 (N-benzyloxycarbonyl-D-arginine-L-glycine-L-arginine-p-nitro- 
aniline), obtained from DiaPharma Group (Franklin, OH). All substrates were 
reconstituted in deionized water prior to use. The final concentration of S-2765 
was 250 //M (about 5-times Km). Purified human Factor X was obtained from 
Enzyme Research Laboratories, Inc. (South Bend, IN) and Factor Xa (FXa) was 
25 activated and prepared from it as described [Bock, P.E., Craig, P.A., Olson, S.T., 
and Singh, P. Arch. Btochem. Biophys. 273:375-388 (1989)]. The enzyme was 
diluted into HBSA prior to assay in which the final concentration was 0.25 nM. 
Recombinant tissue plasminogen activator (rt-PA) Assay 

rt-PA catalytic activity was determined using the substrate, Pefachrome 
30 t-PA (CH 3 S0 2 -D-hexahydrotyrosine-glycyl-L-arginine-p-nitroaniline, obtained from 
Pentapharm Ltd.). The substrate was made up in deionized water followed by 
dilution in HBSA prior to the assay in which the finai concentration was 500 
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micromolar (about 3-times Km). Human rt-PA (Activase®) was obtained from 
Genentech Inc. The enzyme was reconstituted in deionized water and diluted 
into HBSA prior to the assay in which the final concentration was 1.0 nM. 

Plasmin Assay 

5 Plasmin catalytic activity was determined using the chromogenic 

substrate, S-2366 (L-pyroglutamyI-L-pro!y1-L-arginine-p-nitroaniline 
hydrochloride), which was obtained from DiaPharma group. The substrate was 
made up in deionized water followed by dilution in HBSA prior to the assay in 
which the final concentration was 300 micromolar (about 2.5-times Km). 

10 Purified human plasmin was obtained from Enzyme Research Laboratories, Inc. 
The enzyme was diluted into HBSA prior to assay in which the final 
concentration was 1.0 nM. 

Activated Protein C (aPC) Assay 
aPC catalytic activity was determined using the chromogenic substrate, 

15 Pefachrome PC (delta-carbobenzloxy-D-lysine-L-prolyl-L-arginine-p-nitroaniline 

dihydrochloride), obtained from Pentapharm Ltd.). The substrate was made up 
in deionized water followed by dilution in HBSA prior to the assay in which the 
final concentration was 400 micromolar (about 3-times Km). Purified human 
aPC was obtained from Hematologic Technologies, Inc. The enzyme was diluted 

20 into HBSA prior to assay in which the final concentration was 1.0 nM. 

Chymotrypsin Assay 
Chymotrypsin catalytic activity was determined using the chromogenic 
substrate, S-2586 (methoxy-succinyl-L-arginine-L-proIyl-L-tyrosyl-p-nitroanilide), 
which was obtained from DiaPharma Group. The substrate was made up in 

25 deionized water followed by dilution in HBSA prior to the assay in which the final 
concentration was 100 micromolar (about 9-times Km). Purified (3X-crystal!ized; 
CDI) bovine pancreatic alpha-chymotrypsin was obtained from Worthington 
Biochemical Corp. The enzyme was reconstituted in deionized water and diluted 
into HBSA prior to assay in which the final concentration was 0.5 nM. 
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Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A substantially purified single or two chain polypeptide, consisting 
essentially of protease domain of a type-l membrane-type serine protease 25 
(MTSP25) or a catalytically active portion thereof; and optionally additional 

5 sequences of amino acids, wherein the only MTPS25 sequences of amino acids 
are the protease domain portion. 

2. A substantially purified polypeptide selected from the group 
consisting of: 

a) a single chain polypeptide that consists essentially of sequence 
10 of amino acids set forth as residues 1-237 in SEQ ID No. 6, residues 78-323 in 

SEQ ID No. 1 6 or catalytically active portions thereof; and optionally additional 
sequences of amino acids from other than an MTSP25 polypeptide; 

b) an activated two chain polypeptide that comprises the 
sequence of amino acids set forth as residues 1-237 in SEQ ID No. 6 or the 

15 sequence of amino acids set forth as residues 17-313 or residues 78-313 in SEQ 
ID No. 1 6 or catalytically active portions thereof; 

c) an activated two chain or single chain polypeptide that 
consists essentially of residues 17-237 or 1-237 of SEQ ID No. 6 or residues 78- 
323 or C-terminal truncated variants thereof in SEQ ID No. 16 or residues 17- 

20 323 of SEQ ID No. 1 6, or catalytically active portions of any of these 

polypeptides; and optionally additional sequences of amino acids from other than 
an MTPS25 polypeptide; 

d) an activated two chain polypeptide that comprises the sequence 
of amino acids set forth as residues 64-191 of SEQ ID No. 16 or catalytically 

25 active portions thereof; 

e) a polypeptide that includes a sequence of amino acids having at 
least about 60%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
sequence identity with the sequence of amino acids of the MTSP25 portion of 

30 any of the polypeptides of a)-d); 

f) a protease domain of a polypeptide encoded by a splice variant 
of a sequence of nucleotides that encodes an MTSP25 of any of a)-e); and 
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g) a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides encoding the MTPS25 portion of the polypeptides of any of a)-f) 
where the sequence of nucleotides hybridizes along at least about 70%, 80% or 
5 90% of its full-length under such conditions. 

3. The polypeptide of any of claims 1 or 2 that is an activated two 
chain polypeptide. 

4. The polypeptide of claim 1 or claim 2, wherein: 

the MTSP25 portion of the polypeptide consists essentially of the 
10 protease domain of an MTSP25 or a catalytically active portion thereof. 

5. The polypeptide of any of claims 1-4, wherein the MTSP25 is a 
human polypeptide. 

6. The substantially purified polypeptide of claim 1 or claim 2 that 
has at least 60%, 70%, 80% or 90% sequence identity with a polypeptide that 

15 comprises the sequence of amino acids set forth as residues 1-237 in SEQ ID 
No. 6, wherein the substantially purified polypeptide is a protease. 

7. A polypeptide of any of claims 1-6, wherein the protease domain 
portion is encoded by a nucleic acid molecule that hybridizes under conditions of 
high stringency along at least 70% of its full-length to a nucleic acid molecule 

20 comprising a sequence of nucleotides that encodes residues 17-237 in SEQ ID 
No. 5. 

8. A polypeptide of any of claims 1-7 that is a mutein, wherein: 
up to about 50% of the amino acids are replaced with another amino 

acid; 

25 and the resulting polypeptide is a single chain or two chain polypeptide 

that has catalytic activity of at least 1 % of the unmutated polypeptide. 

9. A polypeptide of any of claims 1-7 that is a mutein, wherein: 
up to about 50% of the amino acids are replaced with another amino 

acid; and 

30 the resulting polypeptide is a single chain or two chain polypeptide that 

has catalytic activity of at least 5% of the unmutated polypeptide. 
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10. The polypeptide of claim 8 or claim 9, wherein up to about 10% 
of the amino acids are replaced with another amino acid. 

1 1 . The polypeptide of claim 8 or claim 9, wherein the resulting 
polypeptide is a single chain or two chain polypeptide and has catalytic activity 

5 of at least 10% of the unmutated polypeptide, 

12. The polypeptide of claim 8 or claim 9, wherein: 

a cysteine in the protease domain is replaced with another amino acid; 
and the cysteine is one that is unpaired in a polypeptide that consists essentially 
of the protease domain. 
10 13. The polypeptide of claim 12, wherein the replacing amino acid is a 

serine. 

14. A polypeptide that consists essentially of amino acids 1-17 of SEQ 
ID No. 6 optionally operatively linked as a signal polypeptide to a polypeptide 
other than an MTSP25 polypeptide. 
15 15. A nucleic acid molecule, comprising a sequence of nucleotides that 

encodes a polypeptide of any of claims 1-14. 

16. The nucleic acid molecule of claim 15 that comprises a sequence of 
nucleotides selected from the group consisting of: 

(a) a sequence of nucleotides set forth as nucleotides 1-711 in SEQ ID 
20 No. 5 or set forth in SEQ ID No. 1 5 or a portion thereof that encodes a 

proteolytically active polypeptide; 

(b) a sequence of nucleotides that hybridizes under high stringency along 
its length or along at least about 70% of its full-length to the sequence of 
nucleotides set forth nucleotides 1-71 1 in SEQ ID No. 5; 

25 (c) a sequence of nucleotides that encodes a single chain or activated 

two chain protease domain that comprises a sequence of nucleotides that 
encodes residues 17-323 or 78-323 or catalytically active portions thereof of 
SEQ ID No. 16; 

(d) a sequence of nucleotides that is a splice variant of (a), (b) or (c); 
30 (e) a sequence of nucleotides that encodes the protease domain or a 

catalytically active portion thereof that includes a sequence of 
nucleotides having at least about 60%, 70%, 80%, 90% or 95% 
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sequence identity with the sequence set forth in as nucleotides 1- 
71 1 in SEQ ID No. 5; and 

(f) a sequence of nucleotides comprising degenerate codons of (a), 
(b), (c), (d) or (e). 

5 17. An isolated nucleic molecule that encodes a mutein polypeptide of 

claim 8. 

18. A vector comprising the nucleic acid molecule of any of claims 

15-17. 

1 9. The vector of claim 18 that is an expression vector. 
1Q 20. The vector of claim 19 that is a eukaryotic vector. 

21 . The vector of claim 19 or claim 20 that includes a sequence of 
nucleotides that directs secretion of any polypeptide encoded by a sequence of 
nucleotides operatively linked thereto. 

22. The vector of claim 21 that is a Pichia vector, a mammalian 
15 expression vector, an insect expression vector or an E. coff vector. 

23. A cell, comprising the vector of any of claims 18-22. 

24. The cell of claim 23 that is a prokaryotic cell. 

25. The cell of claim 23 that is a eukaryotic cell. 

26. The cell of claim 23 that is selected from among a bacterial cell, a 
20 yeast cell, a plant cell, an insect cell and an animal cell. 

27. The cell of claim 23 that is a mammalian cell. 

28. A recombinant non-human animal, wherein an endogenous gene 
that encodes a polypeptide of any of claims 1-15 has been deleted or inactivated 
by homologous recombination or insertional mutagenesis of the endogenous 

25 gene in the animal or an ancestor thereof. 

29. A method for producing a polypeptide that contains a protease 
domain of an MTSP25 polypeptide, comprising: 

culturing the cell of any of claims 23-27 under conditions whereby the 
encoded polypeptide is expressed by the cell; and 
30 recovering the expressed polypeptide. 

30. The method of claim 29, wherein the polypeptide is secreted into 
the culture medium. 
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31 . The method of claim 29, wherein the cell is a Pichia cell. 

32. The method of claim 29, wherein the polypeptide is expressed in 
the cytoplasm of the host cell. 

33. A double-stranded RNA (dsRNA) molecule that comprises at least 
5 about 21 contiguous nucleotides or modified nucleotides from a sequence of 

nucleotides encoding an MTSP25 of any of claims 1-14. 

34. An antibody that specifically binds to the single chain form of a 
protease domain of the polypeptide of any of claims 1-14 and/or to the activated 
two chain form of the polypeptide of any of claims 1-14 or of an MTSP25 of 

10 SEQ ID No. 16, or a fragment or derivative of the antibody containing a binding 
domain thereof, wherein the antibody is a polyclonal antibody or a monoclonal 
antibody, wherein the antibody binds to the polypeptide with at least two-fold 
greater affinity than to a full-length zymogen form of an MTSP25. 



35. The antibody of claim 34 that inhibits the enzymatic activity of the 



15 



polypeptide. 



36. 



A conjugate, comprising: 

a polypeptide of any of claims 1-14 or an MTSP25 of SEQ ID No. 



16, and 



a targeting agent linked to the polypeptide directly or via a linker. 



20 



37. The conjugate of claim 36, wherein the targeting agent permits 



affinity isolation or purification of the conjugate; 
attachment of the conjugate to a surface; 
detection of the conjugate; or 
targeted delivery to a selected tissue or cell. 



25 



38. A combination, comprising: 



an agent or treatment that inhibits the catalytic activity of the 



polypeptide of any of claims 1-14 or of an MTSP25; and 



another treatment or agent selected from anti-tumor and anti- 



30 



angiogenic treatments and agents. 

39. The combination of claim 38, wherein the inhibitor and the anti- 
tumor and/or anti-angiogenic agent are formulated in a single pharmaceutical 
composition or each is formulated in separate pharmaceutical compositions. 
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40. The combination of claim 38, wherein the inhibitor is selected from 
antibodies and antisense oligonucleotides and double-stranded RNA (dsRNA). 

41. A solid support, comprising one or more polypeptides of any of 
claims 1-14 linked thereto either directly or via a linker. 

5 42. The support of claim 41, wherein the polypeptides comprise an 

array. 

43. The support of claim 41, wherein the polypeptides comprise a 
plurality of different protease domains. 

44. A solid support comprising one or more nucleic acid molecules of 
10 any of claims 15-17 or oligonucleotide portions thereof linked thereto either 

directly or via a linker, wherein the oligonucleotides contain at least 1 6 
nucleotides. 

45. The support of claim 44, wherein the nucleic acid molecules 
comprise an array. 

15 46. The support of claim 44 or claim 45, wherein the nucleic acid 

molecules comprise a plurality of molecules that encode different protease 
domains. 

47. A method for identifying compounds that modulate the protease 
activity of a polypeptide, comprising: 

20 contacting a polypeptide of any of claims 1-14 with a substrate that is 

proteolytically cleaved by the polypeptide, and, either simultaneously, before or 
after, adding a test compound or plurality thereof; 

measuring the amount of substrate cleaved in the presence of the test 
compound; and 

25 selecting compounds that change the amount of substrate cleaved 

compared to a control, whereby compounds that modulate the activity of the 
polypeptide are identified. 

48. The method of claim 47, wherein the test compounds are small 
molecules, peptides, peptidomimetics, natural products, antibodies or fragments 

30 thereof that modulate the activity of the polypeptide. 

49. The method of claim 47 or claim 48, wherein a plurality of the test 
substances are screened simultaneously. 
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50. The method of any of claims 47-49, wherein the polypeptide is a 
single chain polypeptide that consists essentially of a polypeptide encoded by a 
sequence of nucleotides selected from the group consisting of a sequence of 
nucleotides that includes: 

5 (a) a sequence of nucleotides encoding amino acids 17-323, 64-323, 78- 

323 in SEQ ID No. SEQ ID No. 1 6 or a catalytically active fragment thereof; 

(b) a sequence of nucleotides that hybridizes under high stringency along 
its length or along at least about 70% of its full-length to a sequence of 
nucleotides that encodes amino acids 17-323, 64-323 or 64-323 in SEQ ID No. 

i 

10 SEQ ID No. 1 6 or a catalytically active fragment thereof; 

(c) a sequence of nucleotides that encodes the polypeptide of residues 1- 
237 SEQ ID No. 6 or residues 17-323 of SEQ ID No. 16; 

(d) a sequence of nucleotides that Is a splice variant of (a), (b) or (c); 

(e) a sequence of nucleotides that includes a sequence of nucleotides 
15 having at least about 60%, 70%, 80%, 90% or 95% sequence 

identity with the sequence of nucleotides of any of (a)-(d) 

(f) a sequence of nucleotides comprising degenerate codons of (a), (b), 
(c), (d) or (e). 

51 . The method of any of claims 47-49, wherein the polypeptide 

20 comprises a polypeptide that consists essentially of a polypeptide selected from 

the group consisting of: 

a) a polypeptide that consists essentially of amino acids 1-237 set 

in SEQ ID No. 6 or set forth as amino acids 78-314, 17-314, 78-346 or ,17-346 in 

SEQ ID No. 1 5 or catalytically active fragments thereof; 
25 b) a two-chain polypeptide comprising the sequence of amino acids 

set forth in SEQ ID No. 16; 

c) polypeptide that consists essentially of a sequence of amino 

acids encoded by a sequence of nucleotides that hybridizes under conditions of 

high stringency to along at least 70% of its full-length to a sequence of 
30 nucleotides in encoding a polypeptide of a) or b); 
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d) a polypeptide that comprises a sequence of amino acids having at 
least about 60% sequence identity with the sequence of amino acids of a 
polypeptide of a), b) or c); and 

e) a polypeptide encoded by a sequence of nucleotides that is a 

5 splice variant of a sequence of nucleotides that encodes any of the polypeptides 
of a}-d). 

52. The method of any of claims 47-51, wherein the change in the 
amount of substrate cleaved is assessed by comparing the amount of substrate 
cleaved in the presence of the test compound with the amount of substrate 

10 cleaved in the absence of the test compound. 

53. The method of any of claims 47-52, wherein a plurality of the 
polypeptides are linked to a solid support, either directly or via a linker. 

54. The method of claim 53, wherein the polypeptides comprise an 

array. 

15 55. A method of identifying a compound that specifically binds to a 

single-chain and/or two-chain protease domain and/or to single or two-chain 
polypeptide and/or to a proteolytically active portion of the single or two chain 
form thereof of an MTSP25 polypeptide, comprising: 

contacting an MTSP25 polypeptide of any of claims 1-14 or a 

20 proteolytically active portion thereof with a test compound or plurality 

thereof under conditions conducive to binding thereof; and either: 

a) identifying test compounds that specifically bind to the single 
chain and/or two chain form of the polypeptide or to a proteolytically active 
portion of the single and/or two chain form thereof, or 

25 b) identifying test compounds that inhibit binding of a compound 

known to bind a single chain and/or two chain form of the polypeptide or to a 
proteolytically active portion of the single and/or two chain form thereof, 
wherein the known compound is contacted with the polypeptide before, simul- 
taneously with or after the test compound. 

30 56. The method of claim 55, wherein the polypeptide is linked either 

directly or indirectly via a linker to a solid support. 
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57. The method of claim 55 or claim 56, wherein the test compounds 
are small molecules, peptides, peptidomimetics, natural products, antibodies or 
fragments thereof. 

58. The method of any of claims 55-57, wherein a plurality of the test 
5 substances are screened simultaneously. 

59. The method of any of claims 55-58, wherein a plurality of the 
polypeptides are linked to a solid support. 

60. The method of any of claims 55-59, wherein the polypeptide 
comprises a polypeptide that consists essentially of a polypeptide selected from 

10 the group consisting of: 

a) a polypeptide that consists essentially of amino acids 1-237 set 
in SEQ ID No. 6 or set forth as amino acids 78-314, 17-314, 78-346 or 17-346 
in SEQ ID No. 1 5 or catalytically active fragments thereof; 

b) a two-chain polypeptide comprising the sequence of amino 
15 acids set forth in SEQ ID No. 16; 

c) polypeptide that consists essentially of a sequence of amino 
acids encoded by a sequence of nucleotides that hybridizes under conditions of 
high stringency to along at least 70% of its full-length to a sequence of 
nucleotides in encoding a polypeptide of a) or b); 

20 d) a polypeptide that comprises a sequence of amino acids having 

at least about 60% sequence identity with the sequence of amino acids of a 
polypeptide of a), b) or c); and 

e) a polypeptide encoded by a sequence of nucleotides that is a 
splice variant of a sequence of nucleotides that encodes any of the polypeptides 

25 of a)-d). 

61. A method for identifying activators of the zymogen form of an 
MTSP25, comprising: 

contacting a zymogen form of an MTSP25 polypeptide of any of 
claims 1-14 or the MTSP25 of SEQ ID No. 16 or potentially a proteolytically 
30 active portion thereof with a substrate of the activated form of the polypeptide; 

adding a test compound, wherein the test compound is added 
before, after or simultaneously with the addition of the substrate; and 



WO 03/031585 



PCT/US02/32417 



-171- 

detecting cleavage of the substrate, thereby identifying 
compounds that activate the zymogen. 

62. The method of claim 61, wherein the substrate is a chromogenic 
substrate. 

5 63. The method of claim 61 or claim 62, wherein the substrate is a 

L-pyroglutamyl-L-prolyl-L-arginine-p-nitroaniline hydrochloride. 

64. The method of any of claims 61-63, wherein the test compound is 
a small molecule, a nucleic acid or a polypeptide. 

65. A method for treating or preventing a neoplastic disease, in a 
10 mammal, comprising administering to a mammal an effective amount of an 

inhibitor of the protease activity of an MTSP25 polypeptide. 

66. The method of claim 65, wherein the inhibitor is an antibody that 
specifically binds to the polypeptide, or a fragment or derivative of the antibody 
containing a binding domain thereof, wherein the antibody is a polyclonal 

15 antibody or a monoclonal antibody. 

,67. A method of inhibiting tumor initiation, growth or progression or 
treating a malignant or pre-malignant condition, comprising administering an 
agent that inhibits activation cleavage of the zymogen form of an MTSP25 
polypeptide that comprises a polypeptide of any of claims 1-14 or inhibits an 

20 activity of an activated form of MTSP25 or a proteolytically active portion 
thereof. 

68. The method of claim 67, wherein the condition is a condition of 
the breast, cervix, prostate, lung, ovary or colon. 

69. The method of claim 67 or claim 68, wherein the agent is an 
25 antisense oligonucleotide, double-stranded RNA (dsRISIA) or an antibody. 

* 

70. The method of any of claims 67-69, further comprising 
administering another treatment or agent selected from anti-tumor and anti- 
angiogenic treatments or agents. 

71. A method of identifying a compound that binds to the single-chain 
30 and/or two-chain form of an MTSP25 polypeptide and/or to a proteolytically 

active portion of a single-chain and/or two-chain form of an MTSP25 polypeptide 
of any of claims 1-14 or an MTSP25 of SEQ ID No. 1 6„ comprising: 
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contacting a test compound with both forms; 

determining to which form the compound binds; and 

if it binds to a form of polypeptide, further determining whether the 

compound has at least one of the following properties: 
5 (i) inhibits activation cleavage of the single-chain zymogen form of 

polypeptide; 

(ii) inhibits activity of the two-chain or single-chain form; and 

(iii) inhibits dimerization of the polypeptide. 

72. The method of claim 71, wherein both forms consist essentially of 
10 the protease domain. 

73. A method of detecting neoplastic disease, comprising: detecting a 

» 

polypeptide that comprises a polypeptide of any of claims 1-14 in a biological 
sample, wherein the amount, the form and/or activity detected differs from the 
amount, the form and/or activity of polypeptide detected from a subject who 
15 does not have neoplastic disease. 

74. The method of claim 73, wherein the biological sample is selected 
from the group consisting of blood, urine, saliva, tears, synovial fluid, sweat, 
interstitial fluid, sperm, cerebrospinal fluid, ascites fluid, tumor tissue biopsy and 
circulating tumor cells. 

20 75. A method of diagnosing the presence of a pre-malignant lesion, a 

malignancy, or other pathologic condition in a subject, comprising: 
obtaining a biological sample from the subject; and 

exposing it to a detectable agent that binds to a two-chain and/or single- 
chain form of an MTSP25 polypeptide, wherein: 
25 the pathological condition is characterized by the presence or absence of 

the two-chain or single-chain form; and 

the MTSP25 polypeptide comprises a polypeptide of any of claims 1-14 
or an MTSP25 of SEQ ID No. 1 6. 

76. A method of monitoring tumor progression and/or therapeutic 
30 -effectiveness, comprising detecting and/or quantifying the level, the form and/or 
activity of an MTSP25 polypeptide in a body tissue or fluid sample, wherein: 
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the MTSP25 polypeptide comprises a polypeptide of any of claims 1-14 
or an MTSP25 of SEQ ID No. 1 6. 

77. The method of claim 75 or claim 76, wherein the tumor is a tumor 
of the breast, cervix, prostate, lung, ovary or colon. 
5 78. The method of any of claims 75-77, wherein the body fluid is 

blood, urine, sweat, saliva, cerebrospinal fluid and synovial fluid. 

79. A method for identifying compounds that modulate the protease 
activity of an MTSP25 polypeptide, comprising: 

contacting a polypeptide of any of claims 1-14 or a proteolytically active 
10 portion thereof with a substrate that is proteolytically cleaved by the 

polypeptide, and, either simultaneously, before or after, adding a test compound 
or plurality thereof; 

measuring the amount of substrate cleaved in the presence of the test 
compound; and 

15 selecting compounds that change the amount of substrate cleaved 

compared to a control, whereby compounds that modulate the activity of the 
polypeptide are identified. 

80. A transgenic non-human animal, comprising heterologous nucleic 
acid encoding an MTSP25 polypeptide. 

20 81. A transgenic non-human animal, comprising heterologous nucleic 

acid encoding an MTSP25 polypeptide, wherein the polypeptide comprises a 
polypeptide of any of claims 1-15. 
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SEQUENCE LISTING 

<110> Corvas International, Inc. 
Madison, Edwin 
Yen, Jiunn-Chern 

<120> NUCLEIC ACID MOLECULES ENCODING A TRANSMEMBRANE SERINE PROTEASE 25, THE 
ENCODED POLYPEPTIDES AND METHODS BASED THEREON 

<130> 24745-1621PC 

<140> Not Yet Assigned 
<141> Herewith 

<150> 60/328,530 
<151> 09-OCT-2001 

<160> 18 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 3147 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (23) . . . (2589) 

<223> Nucleotide sequence encoding MTSP1 
<300> 

<301> O'Brien, T.J. and Tanimoto, H. 

<308> GenBank #AR081724 

<309> 2000-08-31 

<310> 5,972,616 

<311> 1998-02-20 

<312> 1999-10-26 

<400> 1 

tcaagagcgg cctcggggta cc atg ggg age gat egg gee cgc aag ggc gga 52 

Met Gly Ser Asp Arg Ala Arg Lys Gly Gly 
1 5 10 

999 ggc ccg aag gac ttc ggc gcg gga etc aag tac aac tec egg cac 100 
Gly Gly Pro Lys Asp Phe Gly Ala Gly Leu Lys Tyr Asn Ser Arg His 

15 * 20 ~ 25 

gag aaa gtg aat ggc ttg gag gaa ggc gtg gag ttc ctg cca gtc aac 148 
Glu Lys Val Asn Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn 

30 35 40 

aac gtc aag aag gtg gaa aag cat ggc ccg ggg cgc tgg gtg gtg ctg 196 
Asn Val Lys Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu 
45 50 55 

gca gec gtg ctg ate ggc etc etc ttg gtc ttg ctg ggg ate ggc ttc 244 
Ala Ala Val Leu lie Gly Leu Leu Leu Val Leu Leu Gly lie Gly Phe 
60 65 70 

ctg gtg tgg cat ttg cag tac egg gac gtg cgt gtc cag aag gtc ttc 2 92 

'Leu Val Trp His Leu Gin Tyr Arg Asp Val Arg Val Gin Lys Val Phe 
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75 80 85 90 

aat ggc tac atg agg ate aca aat gag aat ttt gtg gat gec tac gag 34 0 

Asn Gly Tyr Met Arg lie Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu 

95 100 105 

aac tec aac tec act gag ttt gta age etg gec age aag gtg aag gac 388 
Asn Ser Asn Ser Thr Glu Phe Val Ser Leu Ala Ser Lys Val Lys Asp 

110 115 120 

gcg ctg aag ctg ctg tac age gga gtc cca ttc ctg ggc ccc tac cac 43 6 

Ala Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly Pro Tyr His 
125 130 135 

aag gag teg get gtg acg gec ttc age gag ggc age gtc ate gec tac 484 
Lys Glu Ser Ala Val Thr Ala Phe Ser Glu Gly Ser Val He Ala Tyr 
140 145 150 

tac tgg tct gag ttc age ate ccg cag cac ctg gtg gag gag gee gag 532 
Tyr Trp Ser Glu Phe Ser He Pro Gin His Leu Val Glu Glu Ala Glu 
155 160 165 170 

cgc gtc atg gec gag gag cgc gta gtc atg ctg ccc ccg egg gcg cgc 580 
Arg Val Met Ala Glu Glu Arg Val Val Met Leu Pro Pro Arg Ala Arg 

175 180 185 

tec ctg aag tec ttt gtg gtc aec tea gtg gtg get ttc ccc acg gac 628 
Ser Leu Lys Ser Phe Val Val Thr Ser Val Val Ala Phe Pro Thr Asp 

190 195 200 

tec aaa aca gta cag agg acc cag gac aac age tgc age ttt ggc ctg 676 
Ser Lys Thr Val Gin Arg Thr Gin Asp Asn Ser Cys Ser Phe Gly Leu 
205 210 215 

cac gec cgc ggt gtg gag ctg atg cgc ttc acc acg ccc ggc ttc cct 724 
His Ala Arg Gly Val Glu Leu Met Arg Phe Thr Thr Pro Gly Phe Pro 
220 225 230 

gac age ccc tac ccc get cat gec cgc tgc cag tgg gec ctg egg ggg 772 
Asp Ser Pro Tyr Pro Ala His Ala Arg Cys Gin Trp Ala Leu Arg Gly 
235 240 245 250 

gac gee gac tea gtg ctg age etc acc ttc cgc age ttt gac ctt gcg 82 0 

Asp Ala Asp Ser Val Leu Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala 

255 260 265 

tec tgc gac gag cgc ggc age gac ctg gtg acg gtg tac aac acc ctg 868 
Ser Cys Asp Glu Arg Gly Ser Asp Leu Val Thr Val Tyr Asn Thr Leu 

270 275 280 

age ccc atg gag ccc cac gec ctg gtg cag ttg tgt ggc acc tac cct 916 
Ser Pro Met Glu Pro His Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro 
285 290 * 295 

ccc tec tac aac ctg acc ttc cac tec tec cag aac gtc ctg etc ate 964 
Pro Ser Tyr Asn n Leu Thr Phe His Ser Ser Gin Asn Val Leu Leu He 



t n n 
-> \j \j 



305 310 



aca ctg ata acc aac act gag egg egg cat ccc ggc ttt gag gee 



acc 1012 
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Thr Leu lie Thr Asn Thr Glu Arg Arg His Pro Gly Phe Glu Ala Thr 
315 320 325 330 

ttc ttc cag ctg cct agg atg age age tgt gga ggc cgc tta cgt aaa 1060 
Phe Phe Gin Leu Pro Arg Met Ser Ser Cys Gly Gly Arg Leu Arg Lys 

335 340 ' 345 

gec cag ggg aca ttc aac age ccc tac tac cca ggc cac tac cca ccc 1108 
Ala Gin Gly Thr Phe Asia Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro 

350 355 360 

aac att gac tgc aca tgg aac att gag gtg ccc aac aac cag cat gtg 1156 
Asn lie Asp Cys Thr Trp Asn lie Glu Val Pro Asn Asn Gin His Val 
365 370 375 

aag gtg age ttc aaa ttc ttc tac ctg ctg gag ccc ggc gtg cct gcg 1204 
Lys Val Ser Phe Lys Phe Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala 
380 385 390 

ggc acc tgc ccc aag gac tac gtg gag ate aat ggg gag aaa tac tgc 1252 
Gly Thr Cys Pro Lys Asp Tyr Val Glu lie Asn Gly Glu Lys Tyr Cys 
395 400 405 ~ 410 

gga gag agg tec cag ttc gtc gtc acc age aac age aac aag ate aca 13 00 

Gly Glu Arg Ser Gin Phe Val Val Thr Ser Asn Ser Asn Lys lie Thr 

415 420 ' 425 

gtt cgc ttc cac tea gat cag tec tac acc gac acc ggc ttc tta get 1348 
Val Arg Phe His Ser Asp Gin Ser Tyr Thr Asp Thr Gly Phe Leu Ala 

430 435 440 

• 

gaa tac etc tec tac gac tec agt gac cca tgc ccg ggg cag ttc acg 1396 
Glu Tyr Leu Ser Tyr Asp Ser Ser Asp Pro Cys Pro Gly Gin Phe Thr 
445 450 455 

» 

tgc cgc acg ggg egg tgt ate egg aag gag ctg cgc tgt gat ggc tgg 1444 
Cys Arg Thr Gly Arg Cys lie Arg Lys Glu Leu Arg Cys Asp Gly Trp 
460 465 470 

gee gac tgc acc gac cac age gat gag etc aac tgc agt tgc gac gec 1492 
Ala Asp Cys Thr Asp His Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala 
475 480 485 * 490 

ggc cac cag ttc acg tgc aag aac aag ttc tgc aag ccc etc ttc tgg 1540 
Gly His Gin Phe Thr Cys Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp 

495 - 500 505 

gtc tgc gac agt gtg aac gac tgc gga gac aac age gac gag cag ggg 158 8 

Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn Ser Asp Glu Gin Gly 

510 * ~ 515 520 

tgc agt tgt ccg gee cag acc ttc agg tgt tec aat ggg aag tgc etc 163 6 

Cys Ser Cys Pro Ala Gin Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu 
525 530 ~ ^ 535 

teg aaa age cag cag tgc aat ggg aag gac gac tgt ggg gac ggg tec 1684 
Ser Lys Ser Gin Gin Cys Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser 
540 . "* 545 * " 550 
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gac gag gcc tec tgc ccc aag gtg aac gtc gtc act tgt acc aaa cac 1732 
Asp Glu Ala Ser Cys Pro Lys Val Asn Val Val Thx Cys Thr Lys His 
555 560 565 * ~ 570 

acc tac cgc tgc etc aat ggg etc tgc ttg age aag ggc aac cct gag 1780 
Thr Tyr Arg Cys Leu Asn Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu 

575 580 ^ "* 5B5 

tgt gac ggg aag gag gac tgt age gac ggc tea gat gag aag gac tgc 1828 
Cys Asp Gly Lys Glu Asp Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys 

59 0 595 600 

gac tgt ggg ctg egg tea ttc acg aga cag get cgt gtt gtt ggg ggc 1876 
Asp Cys Gly Leu Arg Ser Phe Thr Arg Gin Ala Arg Val Val Gly Gly 
605 610 615 

acg gat gcg gat gag ggc gag tgg ccc tgg cag gta age ctg cat get 1924 
Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin Val Ser Leu His Ala 
620 625 630 

ctg ggc cag ggc cac ate tgc ggt get tec etc ate tct ccc aac tgg 1972 
Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu lie Ser Pro Asn 'Trp 
635 640 645 650 

ctg gtc tct gcc gca cac tgc tac ate gat gac aga gga ttc agg tac 2020 
Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp Arg Gly Phe Arg Tyr 

655 660 665 

tea gac ccc acg cag tgg acg gcc ttc ctg ggc ttg cac gac cag age 206 8 

Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu His Asp Gin Ser 

670 675 680 

cag cgc age gcc cct ggg gtg cag gag cgc agg etc aag cgc ate ate 2116 
Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu Lys Arg lie He 
685 690 695 

tec cac ccc ttc. ttc aat gac ttc acc ttc gac tat gac ate gcg ctg 2164 
Ser His Pro Phe Phe Ash Asp Phe Thr Phe Asp Tyr Asp He Ala Leu 
700 70S 710 

ctg gag ctg gag aaa ccg gca gag tac age tec atg gtg egg ccc ate 2212 
Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro He 
715 720 725 ~ 730 

tgc ctg ccg gac gcc tec cat gtc ttc cct gcc ggc aag gcc ate tgg 2260 
Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala He Trp 

735 740 " ^ 745 

gtc acg ggc tgg gga cac acc cag tat gga ggc act ggc gcg ctg ate 2308 
Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr Gly Ala Leu He 

750 755 760 

ctg caa aag ggt gag ate cgc gtc ate aac cag acc acc tgc gag aac 2356 
Leu Gin Lys Gly Glu He Arg Val He Asn Gin Thr Thr Cys Glu Asn 
765 770 775 

etc ctg ccg cag cag ate acg ccg cgc atg atg tgc gtg gac ttc etc 24 04 

Leu Leu Pro Gin Gin He Thr Pro Arg Met Met Cys Val Gly Phe Leu 
780 785 790 
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age ggc ggc gtg gac tec tgc cag ggt gat tec ggg gga ccc ctg tec 2452 
Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Ser 
795 800 805 810 

age gtg gag gcg gat ggg egg ate ttc cag gee ggt gtg gtg age tgg 250 0 

Ser Val Glu Ala Asp Gly Arg lie Phe Gin Ala Gly Val Val Ser Trp 

815 820 825 

gga gac ggc tgc get cag agg aac aag cca ggc gtg tac aca agg etc 2548 
Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val Tyr Thr Arg Leu 

830 835 " 840 

cct ctg ttt egg gac tgg ate aaa gag aac act ggg gta ta ggggccgggg 2599 
Pro Leu Phe Arg Asp Trp lie Lys Glu Asn Thr Gly Val 
845 850 " 855 

ccacccaaat gtgtacacct gcggggccac ccatcgtcca ccccagtgtg cacgcctgca 2 659 

ggctggagac tggaccgctg actgcaccag cgcccccaga acatacactg tgaactcaat 2719 

ctccagggct ccaaatctgc ctagaaaacc tctcgcttcc tcagcctcca aagtggagct 2779 

gggaggtaga aggggaggac actggtggtt ctactgaccc aactgggggc aaaggtttga 2 83 9 

agacacagcc tcccccgcca gccccaagct gggecgagge gcgtttgtgt atatctgect 2 899 

cccctgtctg taaggagcag egggaaegga getteggage ctcctcagtg aaggtggtgg 2959 

ggetgccgga tctgggctgt ggggcccttg ggccacgctc ttgaggaagc ccaggctcgg 3019 

aggaccctgg aaaacagacg ggtctgagac tgaaattgtt ttaccagctc. ccagggtgga 3 079 

cttcagtgtg tgtatttgtg taaatgggta aaacaattta tttcttttta aaaaaaaaaa 313 9 

aaaaaaaa 3147 

<210> 2 

<211> 855 

<212> PRT 

<213> Homo Sapien 



<400> 2 






























Met 


Gly 


Ser 


Asp 


Arg 


Ala 


Arg 


Lys 


Gly 


Gly 


Gly Gly Pro 


Lys 


Asp 


Phe 


1 








5 










10 










15 




Gly 


Ala 


Gly 


Leu 


Lys 


Tyr 


Asn 


Ser 


Arg 


His 


Glu 


Lys 


Val 


Asn 


Gly Leu 








20 










25 










30 






Glu 


Glu 


Gly 


Val 


Glu 


Phe 


Leu 


Pro 


Val 


Asn 


Asn 


Val 


Lys 


Lys 


Val 


Glu 






35 










40 










45 






Lys 


His 


Gly 


Pro 


Gly 


Arg 


Trp 


Val 


Val 


Leu 


Ala 


Ala 


Val 


Leu 


He 


Gly 




50 










55 










60 








Leu 


Leu 


Leu 


Val 


Leu 


Leu 


Gly 


He 


Gly 


Phe 


Leu 


Val 


Trp 


His 


Leu 


Gin 


65 










70 










75 








80 


Tyr 


Arg 


Asp 


Val 


Arg 


Val 


Gin 


Lys 


Val 


Phe Asn Gly Tyr Met Arg 


He 










85 










90 










95 




Thr 


Asn 


Glu 


Asn 


Phe 


Val 


Asp 


Ala 


Tyr 


Glu 


Asn 


Ser 


Asn 


Ser 


Thr 


Glu 








100 










105 










110 






Phe 


Val 


Ser 


Leu 


Ala 


Ser 


Lys 


Val 


Lys 


Asp 


Ala 


Leu 


Lys 


Leu 


Leu 


Tyr 






115 










12 0 










125 






Ser 


Gly 


Val 


Pro 


Phe 


Leu 


Gly 


Pro 


Tyr 


His 


Lys 


Glu 


Ser 


Ala 


Val 


Thr 




130 










135 










140 










Ala 


Phe 


Ser 


Glu 


Gly 


Ser 


Val 


He 


Ala 


Tyr 


Tyr 


Trp 


Ser 


Glu 


Phe 


Ser 


145 










150 








155 










160 


lie 


Pro 


Gin 


His 


Leu 


Val 


Glu 


Glu 


Ala 


Glu 


Arg 


Val 


Met 


Ala 


Glu 


Glu 










165 










170 










175 




Arg 


Val 


v a _u 


Met 


Leu 


Pro 




Arg 


Ala 


Arg 


Ser 


Leu 


Lys 


Ser 


Phe 


Val 








180 










185 






190 






Val 


Thr 


Ser 


Val 


Val 


Ala 


Phe 


Pro 


Thr 


Asp 


Ser 


Lys 


Thr 


Val 


Gin 


Arg 
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195 200 205 

Thr Gin Asp Asn Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu 

210 215 220 

Leu Met Arg Phe Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala 
225 230 235 240 

His Ala Arg Cys Gin Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu 

245 250 ~ 255 

Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly 

260 265 270 

Ser Asp Leu Val Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His 

275 280 285 

Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr 

290 295 300 

Phe His Ser Ser Gin Asn Val Leu Leu lie Thr Leu lie Thr Asn Thr 
305 310 315 320 

Glu Arg Arg His Pro Gly Phe Glu Ala Thr Phe Phe Gin Leu Pro Arg 

325 330 335 

Met Ser Ser Cys Gly Gly Arg Leu Arg Lys Ala Gin Gly Thr Phe Asn 

340 345 350 

Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro Asn lie Asp Cys Thr Trp 

355 360 365 

Asn lie Glu Val Pro Asn Asn Gin His Val Lys Val Ser Phe Lys Phe 

370 375 380 

Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp 
385 390 395 400 

Tyr Val Glu lie Asn Gly Glu Lys Tyr Cys Gly Glu Arg Ser Gin Phe 

405 410 415 

Val Val Thr Ser Asn Ser Asn Lys lie Thr Val Arg Phe His Ser Asp 

420 425 ~ 430 

Gin Ser Tyr Thr Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp 

435 440 445 

Ser Ser Asp Pro Cys Pro Gly Gin Phe Thr Cys Arg Thr Gly Arg Cys 

450 455 460 

lie Arg Lys Glu Leu Arg Cys Asp Gly Trp. Ala Asp Cys Thr Asp His 
465 470 475 * 480 

Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala Gly His Gin Phe Thr Cys 

485 490 " 495 

Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp Val Cys Asp Ser Val Aan 

500 505 " * 510 

Asp Cys Gly Asp Asn Ser Asp Glu Gin Gly Cys Ser Cys Pro Ala Gin 

515 520 525 

Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu Ser Lys Ser Gin Gin Cys 

530 535 540 

Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ser Cys Pro 
545 550 555 560 

Lys Val Asn Val Val Thr Cys Thr Lys His Thr Tyr Arg Cys Leu Asn 

565 570 575 

Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu Cys Asp Gly Lys Glu Asp 

580 585 " 590 

Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys Asp Cys Gly Leu Arg Ser 

595 600 60S 

Phe Thr Arg Gin Ala Arg Val Val Gly Gly Thr Asp Ala Asp Glu Gly 

610 615 620 

Glu Trp Pro Trp Gin Val Ser Leu His Ala Leu Gly Gin Gly His lie 
^25 630 635 * 640 

Cys Gly Ala Ser Leu He Ser Pro Asn Trp Leu Val Ser Ala Ala His 

645 650 655 

Cys Tyr He Asp Asp Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp 

660 665 670 



i 



t 
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> 



inr 


n-La 


Phe 


Leu Gly Leu His Asp 


Gin 


Ser 


Gin Arg 


Ser Ala Pro 


Gly 






675 


680 










685 


Val 


Gin 


Glu 


Arg Arg Leu Lys Arg 


He 


lie. 


Ser 


His 


Pro Phe Phe 


Asn 




690 




695 








700 




Asp 


Phe 


Thr 


Phe Asp Tyr Asp He 


Ala 


Leu 


Leu 


Glu 


Leu Glu Lys 


Pro 


705 






710 






715 




720 


Ala 


Glu 


Tyr 


Ser Ser Met Val Arg 


Pro 


He 


Cys 


Leu 


Pro Asp Ala 


Ser 


His 


Val 




725 




73 0 






735 




Phe 


Pro Ala Gly Lys Ala 


He 


Trp 


Val 


Thr 


Gly Trp Gly His 


Thr 






740 


745 








750 




Gin 


Tyr Gly Gly Thr Gly Ala 


Leu 


He 


Leu 


Gin 


Lys Gly Glu 


He 






755 


760 










765 




Arg 


Val 


He 


Asn Gin Thr Thr Cys 


Glu 


Asn 


Leu 


Leu 


Pro Gin Gin 


He 


Thr 


770 




775 








780 






Pro 


Arg Met Met Cys Val Gly 


Phe 


Leu 


Ser 


Gly 


Gly Val Asp 


Ser 


785 


Gin 




790 






795 


800 


Cys 


Gly Asp Ser Gly Gly Pro 


Leu 


Ser 


Ser 


Val 


Glu Ala Asp 


Gly 








805 




810 






815 


Arg 


He 


Phe 


Gin Ala Gly Val Val 


Ser Trp 


Gly Asp 


Gly Cys Ala 


Gin 








820 


825 








830 




Arg Asn 


Lys 


Pro Gly Val Tyr Thr 


Arg 


Leu 


Pro 


Leu 


Phe Arg Asp 


Trp 






835 


840 










845 


He 


Lys 


Glu 


Asn Thr Gly Val 
















850 




855 














<210> 3 



















<211> 3147 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (1865) . . . (2590) 

<223> Nucleic acid sequence of protease domain of MTSP1 
<400> 3 

tcaagagcgg cctcggggta ccatggggag cgatcgggcc cgcaagggcg gagggggccc 60 

gaaggacttc ggcgcgggac tcaagtacaa ctcccggcac gagaaagtga atggcttgga 120 

ggaaggcgtg gagttcctgc cagtcaacaa cgtcaagaag gtggaaaagc atggcccggg 180 

gcgctgggtg gtgctggcag ccgtgctgat cggcctcctc ttggtcttgc tggggatcgg 240 
cttcctggtg tggcatttgc agtaccggga cgtgcgtgtc cagaaggtct tcaatggcta . 300 

catgaggatc acaaatgaga attttgtgga tgcctacgag aactccaact ccactgagtt 360 

tgtaagcctg gccagcaagg tgaaggacgc gctgaagctg ctgtacagcg gagtcccatt 420 

cctgggcccc taccacaagg agtcggctgt gacggccttc agcgagggca gcgtcatcgc 4 80 

ctactactgg tctgagttca gcatcccgca gcacctggtg gaggaggccg agcgcgtcat 540 

ggccgaggag cgcgtagtca tgctgccccc gcgggcgcgc tccctgaagt cctttgtggt 600 

cacctcagtg gtggctttcc ccacggactc caaaacagta cagaggaccc aggacaacag 660 

ctgcagcttt ggcctgcacg cccgcggtgt ggagctgatg cgcttcacca cgcccggctt 720 

ccctgacagc ccctaccccg ctcatgcccg ctgccagtgg gccctgcggg gggacgccga 780 

ctcagtgctg agcctcacct tccgcagctt tgaccttgcg tcctgcgacg agcgcggcag 840 

cgacctggtg acggtgtaca acaccctgag ccccatggag ccccacgccc tggtgcagtt 900 

gtgtggcacc taccctccct cctacaacct gaccttccac tcctcccaga acgtcctgct 960 

catcacactg ataaccaaca ctgagcggcg gcatcccggc tttgaggcca ccttcttcca 1020 

gctgcctagg atgagcagct gtggaggccg cttacgtaaa gcccagggga cattcaacag 1080 

cccctactac ccaggccact acccacccaa cattgactgc acatggaaca ttgaggtgcc 1140 

caacaaccag catgtgaagg tgagcttcaa attcttctac ctgctggagc ccggcgtgcc 12 00 

tgcgggcacc tgccccaagg actacgtgga gatcaatggg gagaaatact gcggagagag 1260 

gtcccagttc gtcgtcacca gcaacagcaa caagatcaca gtfccgcttcc actcagatca 1320 

gtcctacacc gacaccggct tcttagctga atacctctcc tacgactcca gtgacccatg 1380 
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cccggggcag ttcacgtgcc gcacggggcg gtgtatccgg aaggagctgc gctgtgatgg 144 0 

ctgggccgac tgcaccgacc acagcgatga gctcaactgc agttgcgacg ccggccacca 1500 

gttcacgtgc aagaacaagt tctgcaagcc cctcttctgg gtctgcgaca gtgtgaacga 1560 

ctgcggagac aacagcgacg agcaggggtg cagttgtccg gcccagacct tcaggtgttc 162 0 

caatgggaag tgcctctcga aaagccagca gtgcaatggg aaggacgact gtggggacgg 168 0 

gtccgacgag gcctcctgcc ccaaggtgaa cgtcgtcact tgtaccaaac acacctaccg 1740 

ctgcctcaat gggctctgct tgagcaaggg caaccctgag tgtgacggga aggaggactg 180 0 

tagcgacggc tcagatgaga aggactgcga ctgtgggctg cggtcattca cgagacaggc 1860 

tcgt gtt gtt ggg ggc acg gat gcg gat gag ggc gag tgg ccc tgg cag 190 9 
Val Val Gly Gly Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin 
15 10 15 

gta age ctg cat get ctg ggc cag ggc cac ate tgc ggt get tec etc 1957 
Val Ser Leu His Ala Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu 

20 25 30 



ate tct ccc aac tgg ctg gtc tct gee gca cac tgc tac ate gat gac 
lie Ser Pro Asn Trp Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp 

35 40 45 



ttg cac gac cag age cag cgc age gec cct ggg gtg cag gag cgc agg 
Leu His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg 
65 70 75 



2005 



a 9 a gga ttc agg tac tea gac ccc acg cag tgg acg gec ttc ctg ggc 2 053 

Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly 
50 55 60 



2101 



etc aag cgc ate ate tec cac ccc ttc ttc aat gac ttc acc ttc gac 2149 
Leu Lys Arg lie He Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp 
80 85 90 " 95 

tat gac ate gcg ctg ctg gag ctg gag aaa ccg gca gag tac age tec 2197 
Tyr Asp He Ala Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser 

100 105 no 

at 9 gtg egg ccc ate tgc ctg ccg gac gee tec cat gtc ttc cct gec 2245 
Met Val Arg Pro He Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala 

115 120 125 

ggc aag gec ate tgg gtc acg ggc tgg gga cac acc cag tat gga ggc 22 93 

Gly Lys Ala lie Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly 
130 135 140 

act ggc gcg ctg ate ctg caa aag ggt gag ate cgc gtc ate aac cag 2341 
Thr Gly Ala Leu He Leu Gin Lys Gly Glu He Arg Val He Asn Gin 
145 150 155 

acc acc tgc gag aac etc ctg ccg cag cag ate acg ccg cgc atg atg 2389 
Thr Thr Cys Glu Asn Leu Leu Pro Gin Gin He Thr Pro Arg Met Met 
160 165 170 175 

tgc gtg ggc ttc etc age ggc ggc gtg gac tec tgc cag ggt gat tec 2437 
Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser 

180 185 190 

ggg gga ccc ctg tec age gtg gag gcg gat ggg egg ate ttc cag gec 2485 
^riy Gxy Pro Leu Ser Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala 

195 200 ~ 205 
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ggt gtg gtg age tgg gga gac ggc tgc get cag agg aac aag cca ggc .2533 
Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly 
210 215 220 

gtg tac aca agg etc cct ctg ttt egg gac tgg ate aaa gag aac act 2581 
Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp lie Lys Glu Asn Thr 
225 230 235 

999 gta tag gggcegggge cacccaaatg tgtacacctg cggggccacc 263 0 

Gly Val * 

240 

catcgtccac cccagtgtgc acgcctgcag gctggagact ggaccgctga ctgcaccagc 2690 

gcccccagaa catacactgt gaactcaatc tccagggctc caaatctgcc tagaaaacct 275 0 

ctcgcttcct cagcctccaa agtggagctg ggaggtagaa ggggaggaca ctggtggttc 2810 

tactgaccca actgggggca aaggtttgaa gacacagcct cccccgccag ccccaagctg 2870 

ggccgaggcg cgtttgtgta tatctgcctc ccctgtctgt aaggagcagc gggaaeggag 293 0 

cttcggagcc tcctcagtga aggtggtggg getgeeggat ctgggctgtg gggcccttgg 2990 

gccacgctct tgaggaagee caggctegga ggaccctgga aaacagaegg gtctgagact 3 05 0 

gaaattgttt taccagctcc cagggtggac ttcagtgtgt gtatttgtgt aaatgggtaa 3110 

aacaatttat ttctttttaa aaaaaaaaaa aaaaaaa 3147 

<210> 4 

<211> 241 

<212> PRT 

<213> Homo Sapien 

<400> 4 

Val Val Gly Gly Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin Val 

15 10 15 

Ser Leu His Ala Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu lie 

20 25 30 

Ser Pro Asn Trp Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp Arg 

35 40 ~ 45 

Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu 

50 -55 60 

His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu 

65 _ 70 75 80 

Lys Arg lie He Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr 

85 90 95 

Asp He Ala Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met 

100 105 no 

Val Arg Pro He Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly 

115 - 120 125 

Lys Ala He Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr 

130 135 140 

Gly Ala Leu He Leu Gin Lys Gly Glu He Arg Val He Asn Gin Thr 

145 150 155 160 

Thr Cys Glu Asn Leu Leu Pro Gin Gin He Thr Pro Arg Met Met Cys 

165 170 175 

Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly 

180 185 190 

Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala Gly 

195 200 " 205 

Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val 

210 215 220 

Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp He Lys Glu Asn Thr Gly 
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225 230 235 ■ 240 

Val 

<210> 5 

<211> 816 

<212> DMA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (816) 

<223> Nucleotide sequence encoding MTSP25 , including 
MTSP25 protease domain 

<221> mis cofeature 
<222> (248) . . . (270) 

<223> Transmembrane domain encompasses amino acids 
248-270 at the C-terminus of the trypsin-like 
serine protease domain (amino acids 1-237) 

<400> 5 

att ata ggg ggc acc gaa gca caa get ggc gca tgg ccg tgg gtg gtg 48 

lie lie Gly Gly Thr Glu Ala Gin Ala Gly Ala Trp Pro Trp Val Val 

15 10 15 

age ctg cag att aaa tat ggc cgt gtt ctt gtt cat gta tgt ggg gga 96 
Ser Leu Gin lie Lys Tyr Gly Arg Val Leu Val His Val Cys Gly Gly 

20 25 30 

acc eta gtg aga gag agg tgg gtc etc aca get gee cac tgc act aaa 144 
Thr Leu Val Arg Glu Arg Trp Val Leu Thr Ala Ala His Cys Thr Lys 
35 40 45 

gac get age gat cct tta atg tgg aca get gtg att gga act aat aat 192 
Asp Ala Ser Asp Pro Leu Met Trp Thr Ala Val lie Gly Thr Asn Asn 
50 55 60 

ata cat gga cgc tat cct cat acc aag aag ata aaa att aaa gca ate 240 
lie His Gly Arg Tyr Pro His Thr Lys Lys He Lys He Lys Ala He 
65 70 75 " 80 

att att cat cca aac ttc att ttg gaa tct tat gta aat gat att gca 288 
lie He His Pro Asn Phe He Leu Glu Ser Tyr Val Asn Asp He Ala 

85 90 95 

ctt ttt cac tta aaa aaa gca gtg agg tat aat gac tat att cag cct 336 
Leu Phe His Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr He Gin Pro 

100 105 ' HO 

att tgc eta cct ttt gat gtt ttc caa ate ctg gac gga aac aca aag 384 
He Cys Leu Pro Phe Asp Val Phe Gin He Leu Asp Gly Asn Thr Lys 
115 12 0 125 

tgt ttt ata agt ggc tgg gga aga aca aaa gaa gaa ggt aac get aca 43 2 
Cys Phe He Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Ala Thr 
130 135 140 

aat att tta caa gat gca gaa gtg cat tat att tct cga gag atg tgt 48 0 
Asn He Leu Gin Asp Ala Glu Val His Tyr He Ser Arg Glu Met Cys 
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145 

aat tct gag agg 
Asn Ser Glu Arg 



gca ggt gat gaa 
Ala Gly Asp Glu 

180 

gga cca tta atg 
Gly Pro Leu Met 
195 

gga att acc agt 
Gly lie Thr Ser 
210 

gtc tat att ggg 
Val Tyr He Gly 
225 

ttc cat gca age 
Phe His Ala Ser 



cag ate etc ata 
Gin He Leu He 

260 



150 

agt tat ggg gga ata 
Ser Tyr Gly Gly He 
165 

gat gga get ttt gat 
Asp Gly Ala Phe Asp 

185 

tgc tac tta cca gaa 
Cys Tyr Leu Pro Glu 

200 

tac gga cat ggc tgt 
Tyr Gly His Gly Cys 
215 

cca tec ttc tac caa 
Pro Ser Phe Tyr Gin 
230 

act caa ggc ata ctt- 
Thr Gin Gly He Leu 
245 

get tta tgt ttt gtc 
Ala Leu Cys Phe Val 

265 



155 

att cct aac act 
He Pro Asn Thr 
17 0 

act tgc agg ggt 
Thr Cys Arg Gly 



tat aaa aga ttt 
Tyr Lys Arg Phe 

205 

ggt cga aga ggt 
Gly Arg Arg Gly 
220 

aag tgg ctg aca 
Lys Trp Leu Thr 
235 

act ata aat att 
Thr He Asn He 
250 

ate tta eta gca 
He Leu Leu Ala 



160 

tea ttt tgt 528 
Ser Phe Cys 
175 

gac agt ggg 576 

Asp Ser Gly 

190 

ttt gta atg 624 
Phe Val Met 



ttt cct ggt 672 
Phe Pro Gly 



gag cat ttc 720 
Glu His Phe 
240 

tta cgt ggc 768 
Leu Arg Gly 
255 

aca aca taa 816 

Thr Thr * 

270 



<210> 6 

<211> 271 

<212> PRT 

<213> Homo Sapien 

<400> 6 * 

He He Gly Gly Thr Glu Ala Gin Ala Gly Ala Trp Pro Trp Val Val 

1 5 10 15 

Ser Leu Gin He Lys Tyr Gly Arg Val Leu Val His Val Cys Gly Gly 

20 25 30 

Thr Leu Val Arg Glu Arg Trp Val Leu Thr Ala Ala His Cys Thr Lys 

35 40 45 

Asp Ala Ser Asp Pro Leu Met Trp Thr Ala Val He Gly Thr Asn Asn 

50 55 60 

He His Gly Arg Tyr Pro His Thr Lys Lys He Lys He Lys Ala He 
65 70 75 80 

He He His Pro Asn Phe He Leu Glu Ser Tyr Val Asn Asp He Ala 

85 90 ~ 95 

Leu Phe His Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr He Gin Pro 

100 105 no 

He Cys Leu Pro Phe Asp Val Phe Gin He Leu Asp Gly Asn Thr Lys 

115 120 125 

Cys Phe He Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Ala Thr 

130 135 140 

Asn He Leu Gin Asp Ala Glu Val His Tyr He Ser Arg Glu Met Cys 
14 5 150 155 160 

Asn Ser Glu Arg Ser Tyr Gly Gly He He Pro Asn Thr Ser Phe Cys 
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165 




Ala 


Gly Asp 


Glu 


Asp 


Gly Ala 








180 






Gly Pro 


Leu 


Met 


Cys 


Tyr Leu 






195 






Gly 


He 


Thr 


Ser 


Tyr 


Gly His 




210 








215 


Val 


Tyr 


He 


Gly 


Pro 


Ser Phe 


225 








230 


Phe 


His 


Ala 


Ser 


Thr 


Gin Gly 










245 


Gin 


He 


Leu 


He 


Ala 


Leu Cys 








260 





170 175 
Phe Asp Thr Cys Arg Gly Asp Ser Gly 

185 190 
Pro Glu Tyr Lys Arg Phe Phe Val Met 
200 205 
Gly Cys Gly Arg Arg Gly Phe Pro Gly 

22 0 

Tyr Gin Lys Trp Leu Thr Glu His Phe 

235 240 
He Leu Thr He Asn He Leu Arg Gly 

250 255 
Phe Val He Leu Leu Ala Thr Thr 
265 270 

<210> 7 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5' end primer 
<400> 7 

gcaccgctta aggatgtgtt gcaagggtc 2 9 

<210> 8 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 3' end primer 
<400> 8 

atgctctgtc agccactttt ggtagaagg 29 

<210> 9 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR 5' end primer 
<400> 9 

ggaaacacaa agtgttttat aagtggctgg 30 

<210> 10 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR 3' end primer 
<400> 10 

cattttgtgc aggtgatgaa gatggagc ' 28 

<210> 11 
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<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR 5' end primer 
<400> 11 

aggctgaata tagtcattat acctcactgc 

<210> 12 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR 3' end primer 
<400> 12 

catcagcttc tcctgacata aatccagcac 



<210> 13 
<211> 9276 
<212> DNA 

<213> Pichia pastoris 



<400> 13 

agatctaaca tccaaagacg aaaggttgaa tgaaaccttt ttgccatccg acatccacag 60 
gtccattctc acacataagt gccaaacgca acaggagggg atacactagc agcagaccgt 12 0 

tgcaaacgca ggacctccac tcctcttctc ctcaacaccc acttttgcca tcgaaaaacc 



180 



agcccagtta ttgggcttga ttggagctcg ctcattccaa ttccttctat taggctacta 240 



acaccatgac tttattagcc tgtctatcct ggcccccctg gcgaggttca tgtttgttta 300 

tttccgaatg caacaagctc cgcattacac ccgaacatca ctccagatga gggctttctg 360 

agtgtggggt caaatagttt catgttcccc aaatggccca aaactgacag tttaaacgct 42 0 

gtcttggaac ctaatatgac aaaagcgtga tctcatccaa gatgaactaa gtttggttcg 480 

ttgaaatgct aacggccagt tggtcaaaaa gaaacttcca aaagtcgcca taccgtttgt 54 0 

cttgtttggt attgattgac gaatgctcaa aaataatctc attaatgctt agcgcagtct 60 0 

ctctatcgct tctgaacccc ggtgcacctg tgccgaaacg caaatgggga aacacccgct 660 

ttttggatga ttatgcattg tctccacatt gtatgcttcc aagattctgg tgggaatact 720 

gctgatagcc taacgttcat gatcaaaatt taactgttct aacccctact tgacagcaat 78 0 

atataaacag aaggaagctg ccctgtctta aacctttttt tttatcatca ttattagctt 840 

actttcataa tfcgcgactgg ttccaattga caagcttttg attttaacga cttttaacga 90 0 

caacttgaga agatcaaaaa acaactaatt attcgaagga tccaaacgat gagatttcct 960 

tcaattttta ctgcagtttt attcgcagca tcctccgcat tagctgctcc agtcaacact 1020 

acaacagaag atgaaacggc acaaattccg gctgaagctg tcatcggtta ctcagattta 1080 

gaaggggatt tcgatgttgc tgttttgcca ttttccaaca gcacaaataa cgggttattg 1140 

tttataaata ctactattgc cagcattgct gctaaagaag aaggggtatc tctcgagaaa 12 0 0 

agagaggctg aagcttacgt agaattccct agggcggccg cgaattaatt cgccttagac 1260 

atgactgttc ctcagttcaa gttgggcact tacgagaaga ccggtcttgc tagattctaa 132 0 

tcaagaggat gtcagaatgc catttgcctg agagatgcag gcttcatttt tgatactttt 1380 

ttatttgtaa cctatatagt ataggatttt ttttgtcatt ttgtttcttc tcgtacgagc 144 0 

ttgctcctga tcagcctatc tcgcagctga tgaatatctt gtggtagggg tttgggaaaa 1500 

tcattcgagt ttgatgtttt tcttggtatt tcccactcct cttcagagta cagaagatta 156 0 

agtgagaagt tcgtttgtgc aagcttatcg ataagcttta atgcggtagt ttatcacagt 162 0 

taaattgcta acgcagtcag gcaccgtgta tgaaatctaa caatgcgctc atcgtcatcc 168 0 

tcggcaccgt caccctggat gctgtaggca taggcttggt tatgccggta ctgccgggcc 174 0 

tcttgcggga tatcgtccat tccgacagca tcgccagtca ctatggcgtg ctgctagcgc 1800 



WO 03/031585 PCIYUS02/32417 

14/19 



tatatgcgtt gatgcaattt ctatgcgcac ccgttctcgg agcactgtcc gaccgctttg 1860 
gccgccgccc agtcctgctc gcttcgctac ttggagccac tatcgactac gcgalcatgf 1920 
cgaccacacc cgtcctgtgg atctatcgaa tctaaatgta agttaaaatc tctaaataat 1980 
taaataagtc ccagtttctc catacgaacc ttaacagcat tgcggtgagc atctaqacct 2040 
tcaacagcag ccagatccat cactgcttgg ccaatatgtt tlagtccc?c aggagfScg £ao 
tcttgtgaag tgatgaactt ctggaaggtt gcagtgttaa ctccgctgta ttfacgggca 2160 
tmil a ° 9 tt 99 caaa 9 fc gtggttggta ccggaggagt aatctccaca actctctgga 2220 
t^t™?^ " aCaa f ac agatccagcg tgttgtactt gatcaacata agaagaagca 22 80 
tcot™^ Scaggatcaa gtgttcagga gogtactgat tggacatttc caaagccigc 2340 
^^ ag9ttg caaccgatag ggttgtagag tgtgcaatac acttgcgtac aatttcaacc 2400 
^? gcaaCt 9 caca 9=ttg gttgtgaaca gcatcttcaa ttctggcaag ctccttgtct 2460 
gtcatatcga cagccaacag aatcacctgg gaatcaatac catgttcagc ttgagacaqa 2520 
aggtctgagg caacgaaatc tggatcagcg tatttatcag caataactag aacttcaqaa 2580 
ggcccagcag gcatgtcaat actacaoagg gctgatgtgt cattttgaac catcatcttq 2640 
g f 39 ° a ? taa c 9 aact ggtt tcctggacca aatattttgt cacacttagg aacagtttct 2700 

9 "f°?^ aa ? ccatagcagc tactgcctgg gcgoctcctg ctagcacgat acacttagca 2760 

Hl^tl 9t 9g3 ? aaC ?B a 9 at 9 ac ttct ggggtaaggg taccatcctt cttaggtgga 2820 

gatgcaaaaa caatttcttt gcaaccagca actttggcag gaacacccag catcagggaa 2880 

gtggaaggca gaattgcggt tccaccagga atatagaggc caactttctc aataqqtctt 2940 

gcaaaacgag agcagactac accagggcaa gtctcaactt gcaacgtctc cgttaqttqa 3000 

lllllltlll atttcctgac gttatctata gagagatcaa tggctctctt alcgtlatlt 3060 

ggcaattgca taagttcctc tgggaaagga gcttctaaca caggtgtctt caaagcgact 3120 

ccatcaaact tggcagttag ttctaaaagg gctttgtcac caEttEgacg aaca^tgtcg 3180 

a " t a "f? ^ gaC ^ aattc ca taatotgt tccgttttct ggataggacg acgaagggc! 3240 

tcttcaattt cttgtgagga ggccttagaa acgtcaattt tgcacaattc aatacgacct 3300 

C "r^ a9 ? ^^Sattct tctttaggtt gttccttggt gtatcctggc 3360 

ttggcatctc ctttccttct agtgacottt agggacttca tatccaggtt tototccacc 342 0 

tcgtccaacg tcacaccgta cttggcacat ctaactaatg caaaataaaa taagtcagca 3480 

cattcccagg ctatatcttc cttggattta gcttctgcaa gttcatcagc ttcctcccta 3540 

^=^ a ?^ ); caacaaaac ttcgtcgtca aataaccgtt tggtataaga accttctgga 3600 

lltttf, tacgatccca caaggtggct tccatggctc taagaccctt tgattggcca 3660 

aaa ^ 9 ? 39 fc 9=gtt C caa gtgacagaaa ccaacacctg tttgttcaac cacaaatttc 3720 

aa ?° ag ^ t ° catcacaatc caattcgata cccagcaact tttgagttgc tccagatgta 3780 

g S a ? accacaaacc gtgacgacga gattggtaga ctccagtttg tgtccttata 3840 

f^=^f 9aa Jagacttttt ggacgagtac accaggccca acgagtaatt agaagagtca 3 900 

gccaccaaag tagtgaatag accatcgggg cggtcagtag tcaaagacgc caacaaaatt 3960 

ao a ?n? a ? a ? g 9 aact "tt gacatcttca gaaagttogt attcagtagt caattgccga 4020 

gcatcaataa tggggattat accagaagca acagtggaag tcacatctac caactttgcg 4080 

gtctcagaaa aagcataaac agttctacta ccgccattag tgaaactttt caaatcgccc 4140 

agtggagaag aaaaaggcac agcgatacta gcattagcgg gcaaggatgc aactttatca 4200 

a " ag9gtc ° tafca 9ataac cctagcgcct gggatcatcc tttggacaac tctttctgcc 4260 

aaatctaggt ccaaaatcac ttcattgata ccattattgt acaacttgag caagttqtcq 4320 

lltlf^tlt caaatfc 99tc ctctgtaacg gatgactcaa cttgcacatt aac?tgaagc 43 80 

^ a ?^ 9a " 9 a 9t9 aa =tt gatcaggttg tgcagctggt cagcagcata gggaaacalg 4440 

gcttttccta ccaaactcaa ggaattatca aactctgcaa cacttgcgta tgcaggtagc 45 00 

ta??? aaa ^? t ^ ata = tt 9 a a 9tcggacag tgagtgtagt cttgafalat tJtgllgclg tseo 

catttttatt atcagtgagt cagtcatcag gagatcctct acgccggacg catcgtgqcc 4620 

tac^f^?? 9939999999 gcgctgaggt ctgcctcgtg aagaaggtgt tgctgactca 4680 

t*ttl*fl C l gaatcgcccc atcatccagc cagaaagtga gggagccacg gttgatgaga 4740 

gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac ggaacgqtlt 4800 

™^ 99 gaagat ?cgt gatctgatcc ttcaactcag laaaagttcg at^af^caa 4860 

caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgttacaac caattaacca 4920 

tatcaf * ? aaaaa ^ Ca tc 9 a g= a t= a aatgaaactg caltttattc atatcaggat llll 

tatcaatacc atatttttga aaaagccgtt tctgtaatga aggagaaaac tcaccgaggc 5040 

tlllmttl 9atggcaaga tcctggtatc ggtctgcgat tccgactcgt ccaacltlla 5100 

t^ aa ^ a = taattt J ccc tcgtcaaaaa taaggttatc aagtgagaaa tcaccatgag 5160 

ef™?™^* a ^ cc 9gtgag aatggcaaaa gcttatgcat ttctttccag acttgttlai 5220 

caggccagcc atcacgcncg tcatcaaaat cactcgcatc aaccaaaccg ttattcattc 52 80 

gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa ttacaaacag 5340 
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gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt tcacctgaat 5400 

caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagtg gtgagtaacc 5460 

atgcatcatc aggagtacgg ataaaatgcfc tgatggtcgg aagaggcata aattccgtca 552 0 

gccagtttag tctgaccatc tcatctgtaa catcattggc aacgctacct ttgccatgtt 5580 

tcagaaacaa ctctggcgca tcgggcttcc catacaatcg atagattgtc gcacctgatt 5640 

gcccgacatt atcgcgagcc catttatacc catataaatc agcatccatg ttggaattta 5700 

atcgcggcct cgagcaagac gtttcccgtt gaatatggct cataacaccc cttgtattac 5760 

tgtttatgta agcagacagt tttattgttc atgatgatat atttttatct tgtgcaatgt 5820 

aacatcagag attttgagac acaacgtggc tttccccccc ccccctgcag gtcggcatca 5880 

ccggcgccac aggtgcggtt gctggcgcct atatcgccga catcaccgat ggggaagatc 5940 

gggctcgcca cttcgggctc atgagcgctt gtttcggcgt gggtatggtg gcaggccccg 6000 

tggccggggg actgttgggc gccatctcct tgcatgcacc attccttgcg gcggcggtgc 6060 

tcaacggcct caacctacta ctgggctgct tcctaatgca ggagtcgcat aagggagagc 6120 

gtcgagtatc tatgattgga agtatgggaa tggtgatacc cgcattcttc agtgtcttga 6180 

ggtctcctat cagattatgc ccaactaaag caaccggagg aggagatttc atggtaaatt 6240 

tctctgactt ttggtcatca gtagactcga actgtgagac tatctcggtt atgacagcag 63 00 

aaatgtcctt cttggagaca gtaaatgaag tcccaccaat aaagaaatcc ttgttatcag 63 60 

gaacaaactt cttgtttcga actttttcgg tgccttgaac tataaaatgt agagtggata 6420 

tgtcgggtag gaatggagcg ggcaaatgct taccttctgg accttcaaga ggtatgtagg 64 80 

gtttgtagat actgatgcca acttcagtga caacgttgct atttcgttca aaccattccg 6540 

aatccagaga aatcaaagtt gtttgtctac tattgatcca agccagtgcg gtcttgaaac 6600 

tgacaatagt gtgctcgtgt tttgaggtca tctttgtatg aataaatcta gtctttgatc* 6660 

taaataatct tgacgagcca aggcgataaa tacccaaatc taaaactctt ttaaaacgtt 6720 

aaaaggacaa gtatgtctgc ctgtattaaa ccccaaatca gctcgtagtc tgatcctcat 6780 

caacttgagg ggcactatct tgttttagag aaatttgcgg agatgcgata tcgagaaaaa 6840 

ggtacgctga ttttaaacgt gaaatttatc tcaagatctc tgcctcgcgc gtttcggtga 6900 

tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 6960 

ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 7020 

cgcagccatg acccagtcac gtagcgatag cggagtgtat actggcttaa ctatgcggca 7080 

tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 7140 

aggagaaaat accgcatcag gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 7200 

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 72 60 

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 7320 

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 73 80 

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 7440 

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 7500 

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 7560 

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 7620 

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 7680 

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 7740 

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 7800 

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 7860 

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 792 0 

aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 7980 

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 8040 

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 8100 

gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 8160 

tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct. 8220 

ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 8280 

ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 8340 

atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 84 0 0 

cgcaacgttg ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct 8460 

tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 8520 

aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 8580 

tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 8640 

Zr^ Ct9tga ct S9tgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 8700 

agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa 87 60 

gtgcucatca uugyaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 8820 

agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 88 80 



WO 03/031585 



16/19 



PCI7US02/32417 



accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 894 0 

gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 9000 

cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 9060 

ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 9120 

atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaattaatt 9180 

ctcatgtttg acagcttatc atcgataagc tgactcatgt tggtattgtg aaatagacgc 924 0 

agatcgggaa cactgaaaaa taacagttat tattcg ~* ~~ 927 6 



<210> 14 
<211> 11 
<212> PRT 

<213> :Pichia protease cleavage site 
<400> . 14 

Lys Arg lie Ala Ser Gly Val lie Ala Pro Lys 
15 10 



<210> ,15 

<211> 1047 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (1044) 

<223> MTSP25 cDNA 

<400> 15 

atg egg ctg' ggg etc ctg age gtg gcg ctg ttg ttt gtg ggg age tct 4 8 

Met Arg Leu Gly Leu Leu Ser Val Ala Leu Leu Phe Val Gly Ser Ser 
1 5 10 15 

cac tta tac tea gac cac tac teg ccc tct gga agg cac agg etc ggc 96 
His Leu Tyr* Ser Asp His Tyr Ser Pro Ser Gly Arg His Arg Leu Gly , 

20 25 30 

ccc teg ccg gaa ccg gcg get agt tec cag cag get gag gee gtc cgc 144 
Pro Ser Pro. Glu Pro Ala Ala Ser Ser Gin Gin Ala Glu Ala Val Arg 
35 40 45 

aag agg etc egg egg egg agg gag gga ggg gcg cat gca aag gat tgt 192 
Lys Arg Leu Arg Arg Arg Arg Glu Gly Gly Ala His Ala Lys Asp Cys 
50 55 60 

gga aca gca ccg ctt aag gat gtg ttg caa ggg tct egg att ata ggg 240 
Gly Thr Ala Pro Leu Lys Asp Val Leu Gin Gly Ser Arg lie He Gly 
65 70 75 80 

ggc ace gaa gca caa get ggc gca tgg ccg tgg gtg gtg age ctg cag 2 88 

Gly Thr Glu Ala Gin Ala Gly Ala Trp Pro Trp Val Val Ser Leu Gin 

85 90 95 

att aaa tat ggc cgt gtt ctt gtt cat gta tgt ggg gga acc eta gtg 3 36 

He Lys Tyr Gly Arg Val Leu Val His Val Cys Gly Gly Thr Leu Val 

100 105 no 

aga gag agg tgg gtc etc aca get gee cac tgc act aaa gac get age 3 84 

Arg Glu Arg Trp Val Leu Thr Ala Ala His Cys Thr Lys Asp Ala Ser 
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528 



720 



115 120 125 

gat cct tta atg tgg aca get gtg att gga act aat aat ata cat gga 432 
Asp Pro Leu Met Trp Thr Ala Val lie Gly Thr Asn Asn He His Gly 
13 0 135 140 

cgc tat cct cat acc aag aag ata aaa att aaa gca ate att att cat 480 
Arg Tyr Pro His Thr Lys Lys He Lys He Lys Ala He He He His 
145 150 155 160 

cca aac ttc att ttg gaa tct tat gta aat gat att gca ctt ttt cac 
Pro Asn Phe He Leu Glu Ser Tyr Val Asn Asp He Ala Leu Phe His 

165 170 175 

tta aaa aaa gca gtg agg tat aat gac tat att cag cct att tgc eta 576 
Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr He Gin Pro He Cys Leu 

180 185 190 

cct ttt gat gtt ttc caa ate ctg gac gga aac aca aag tgt ttt ata 624 
Pro Phe Asp Val Phe Gin He Leu Asp Gly Asn Thr Lys Cys Phe He 
195 200 205 

agt ggc tgg gga aga aca aaa gaa gaa ggt aac get aca aat att tta 672 
Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Ala Thr Asn He Leu 
210 215 220 

caa gat gca gaa gtg cat tat att tct cga gag atg tgt aat tct gag 
Gin Asp Ala Glu Val His Tyr He Ser Arg Glu Met Cys Asn Ser Glu 
225 230 235 240 

agg agt tat ggg gga ata att cct aac act tea ttt tgt gca ggt gat 768 
Arg Ser Tyr Gly Gly He He Pro Asn Thr Ser Phe Cys Ala Gly Asp 

245 250 255 

gaa gat gga get ttt gat act tgc agg ggt gac agt ggg gga cca tta 816 
Glu Asp Gly Ala Phe Asp Thr Cys Arg Gly Asp Ser Gly Gly Pro Leu 

260 265 270 

atg tgc tac tta cca gaa tat aaa aga ttt ttt gta atg gga att acc 864 
Met Cys Tyr Leu Pro Glu Tyr Lys Arg Phe Phe Val Met Gly He Thr 
275 280 285 

agt tac gga cat ggc tgt ggt cga aga ggt ttt cct ggt gtc tat att 912 
Ser Tyr Gly His Gly Cys Gly Arg Arg Gly Phe Pro Gly Val Tyr He 
290 295 300 

ggg cca tec ttc tac caa aag tgg ctg aca gag cat ttc ttc cat gca 
Gly Pro Ser Phe Tyr Gin Lys Trp Leu Thr Glu His Phe Phe His Ala 
305 310 315 320 

age act caa ggc ata ctt act ata aat att tta cgt ggc caa ate etc 
Ser Thr Gin Gly He Leu Thr He Asn He Leu Arg Gly Gin He Leu 

325 330 335 

ata get tta tgt ttt gtc ate tta eta gca aca aca taa 
He Ala Leu Cys Phe Val He Leu Leu Ala Thr Thr 

340 345 



960 



1008 



1047 
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<210> 16 
<211> 348 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Met Arg Leu Gly Leu Leu Ser Val Ala Leu Leu Phe Val Gly Ser Ser 

1 5 io 15 

His Leu Tyr Ser Asp His Tyr Ser Pro Ser Gly Arg His Arg Leu Gly 

20 25 30 

Pro Ser Pro Glu Pro Ala Ala Ser Ser Gin Gin Ala Glu Ala Val Arq 

35 .40 45 
Lys Arg Leu Arg Arg Arg Arg Glu Gly Gly Ala His Ala Lys Asp Cys 

50 55 60 

Gly Thr Ala Pro Leu Lys Asp Val Leu Gin Gly Ser Arg He He Glv 
65 70 75 80 

Gly Thr Glu Ala Gin Ala Gly Ala Trp Pro Trp Val Val Ser Leu Gin 

85 90 95 

He Lys Tyr Gly Arg Val Leu Val His Val Cys Gly Gly Thr Leu Val 

100 105 no 

Arg Glu Arg Trp Val Leu Thr Ala Ala His Cys Thr Lys Asp Ala Ser 

115 120 125 

Asp Pro Leu Met Trp Thr Ala Val He Gly Thr Asn Asn He His Glv 

130 135 140 

Arg Tyr Pro His Thr Lys Lys He Lys He Lys Ala He He He His 
145 150 155 160 

Pro Asn Phe lie Leu Glu Ser Tyr Val Asn Asp He Ala Leu Phe His 

165 170 175 

Leu Lys Lys Ala Val Arg Tyr Asn Asp Tyr He Gin Pro He Cys Leu 

180 185 190 

Pro Phe Asp Val Phe Gin He Leu Asp Gly Asn Thr Lys Cys Phe He 

195 200 205 

Ser Gly Trp Gly Arg Thr Lys Glu Glu Gly Asn Ala Thr Asn He Leu 

210 215 220 

Gin Asp Ala Glu Val His Tyr He Ser Arg Glu Met Cys Asn Ser Glu 
225 230 235 240 

Arg Ser Tyr Gly Gly He He Pro Asn Thr Ser Phe Cys Ala Glv Aso 

245 250 255 

Glu Asp Gly Ala Phe Asp Thr Cys Arg Gly Asp Ser Gly Gly Pro Leu 

260 265 270 

Met Cys Tyr Leu Pro Glu Tyr Lys Arg Phe Phe Val Met Gly He Thr 

275 280 285 

Ser Tyr Gly His Gly Cys Gly Arg Arg Gly Phe Pro Gly Val Tyr He 

2 90 295 30 0 

Gly Pro Ser Phe Tyr Gin Lys Trp Leu Thr Glu His Phe Phe His Ala 
305 310 315 320 

Ser Thr Gin Gly He Leu Thr He Asn He Leu Arg Gly Gin He Leu 

325 330 ~ " 335 

He Ala Leu Cys Phe Val He Leu Leu Ala Thr Thr 

340 345 



<210> 17 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> primer 
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<400> 17 

cacagctgtc cacattaaag gatcgctagc 

<210> 18 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> start -codon- containing fragment 

<400> 18 
tccaaaatgc gg 



30 

of MTSP2 5 -encoding nucleic acid 

12 



